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A Molded-Plastic Technique for Connecting 
and Splicing Optical-Fiber Tapes 
and Cables 


By P. W. SMITH, D. L. BISBEE, D. GLOGE, and E. L. CHINNOCK 
(Manuscript received September 25, 1974) 


We describe a new technique for optical-fiber cable connecting and 
splicing. Preliminary tests with multimode fibers produced splices with an 
average loss of less than 0.1 dB and a peak loss of 0.18 dB. 


l. INTRODUCTION 


Despite the efforts of many investigators,'~” the problem of con- 
necting and splicing optical-fiber cables and subgroups of fibers® 
(tapes) has remained a serious one. The continued improvement of 
optical fibers to the point where losses approaching 1 dB/km have now 
been achieved'* has made it increasingly apparent that practical con- 
nectors and splices should have losses much lower than those initially 
considered. 

Someda* demonstrated a splicing technique in which individual 
fibers are aligned by pressing them into a grooved substrate. Much 
subsequent work has involved extensions and improvements of this 
idea. Miller? used precision-grooved aluminum spacers and prepared 
the fiber ends by grinding and polishing. Cherin! used embossed 
grooves and devised a jig for inserting tapes with previously prepared 
fiber ends into these grooves. The lowest losses in splices based on this 
technique were obtained by Chinnock et al.,2 who prepared the ends 
of the fibers using a fiber-fracture technique.’ All these methods, how- 
ever, have drawbacks resulting either from difficulties associated 
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with the preparation of the fiber ends or with the mechanical align- 
ment of the previously prepared ends. These problems may make 
such techniques difficult to apply in the field. 

In this paper we describe a new fiber-splicing technique that elimi- 
nates some of the difficulties associated with the grooved substrate 
type of splices, can function as a removable connector, and should be 
readily adaptable for field use. 


Il. THE MOLDED-PLASTIC SPLICING TECHNIQUE 


The basic technique is illustrated in Fig. 1. The end of the tape to 
be spliced is prepared for molding by dissolving the plastic coating 
over a short region (1 cm) to expose the individual fibers. The tape 
with the exposed fibers is then placed in the mold as shown in Fig. 1a. 
The fibers are held accurately in position by means of a thin spacer 
plate (see insert). After a suitable plastic material is molded around 
the fibers, the entire assembly is removed from the mold (Fig. 1b). 
The fibers are exposed over a narrow region where the spacer plate 
held them in position in the mold. The exposed fibers are now scored, 
and the entire assembly is fractured by bending and applying ten- 
sion In the manner previously described for single fibers.’ The plastic 
material fractures in the same plane as the optical fibers, and the 
tape termination 1s now ready for splicing (Iig. 1c). To make a 
splice, two tapes with terminations prepared as described above are 
placed in an alignment channel, and a suitable index-matching epoxy 
is used to index-match and to hold the assembly together (Fig. 1d). 

The splicing technique described above has a number of important 
features. Each operation is relatively simple and involves no handling 
of individual fibers. The fiber-breaking technique quickly produces 
clean ends of good optical quality. Minimal handling of the prepared 
ends is required. The technique can easily be adapted to make a re- 
movable connection. 

To demonstrate these ideas, a mold was constructed as shown in 
Fig. 2. The mold was machined from brass and was made in two 
parts so that it could be taken apart to facilitate the removal of the 
molded tape ends. The spacer plate was made from 175-um steel feeler 
gauge stock which was tapered to about 100 um at the top. Figure 2b 
shows a close-up of the spacer plate. 

Because polyester resin* is readily available, has a low initial vis- 
cosity, and shrinks little on hardening, it was used as the molding 


” The polyester resin used for these experiments was No. 50111, Berton Plastics, 
Inc., South Hackensack, N. J. 
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Fig. 1—The molded-plastic fiber-tape splicing technique. 
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lig. 2—Photograph of the mold used for these experiments. (a) Overall view. (b) 
Close-up view of precision spacer. 


material, even though it required curing for several hours at ap- 
proximately 50°C. Low-shrinkage plastics are available, however, that 
cure in a short time at room temperature.” 

The optical-fiber tapes used for these splicing experiments were 
made from multimode silicate glass fibers of 120-um outer diameter and 
80-um core diameter. The numerical aperture of these fibers was 
measured to be 0.15. The fibers were made into tapes by dipping 2-m 
lengths of fiber into a solution of plastic material’ and then slowly 


e ‘i example, Facsimile, made by Flexbar Machine Corp., Farmingdale, New 
ork. 


* The coating material used was 3M Kel-F-800, a co-polymer of vinylidene fluoride 
and chlortrifluoroethylene. 
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withdrawing the fibers. In this way, a 25-um plastic coating was ap- 
plied to the fiber. To eliminate crosstalk between fibers, a black dye 
was mixed with the coating material. The plastic-coated fibers were 
then fused into a linear array using the technique previously developed 
by Eichenbaum.'® 

Figure 3 shows a section of five-fiber tape with the plastic material 
removed in preparation for splicing. The material was easily removed 
by applying acetone with a cotton swab. The next step in the splicing 
operation involves placing the prepared tape in the mold shown in 
Fig. 2. To hold the fibers positively in the grooves of the spacer plate, 
a slight tension was applied to the tape by clamping it to the spring 
steel extensions on either side of the mold (see Fig. 2). Because the 
mold we were using was somewhat wider than the tape, we also placed 
pads on either side of the tape to hold the fibers straight as they passed 
over the spacer plate. Such pads would not be required if a narrower 
mold were used. Small pads of balsa wood were also used on top of the 
tape about 1 cm on either side of the spacer to ensure that the fibers 
remained pressed into the spacer grooves during the molding process. 
Polyester resin was then poured around the fibers, the lid of the mold 





Fig. 3—Fiber tape with plastic material removed in preparation for placing in mold. 
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Fig. 4—Fiber tape end after molding polyester resin and before scoring fibers. 
Note exposed fibers. 


closed, and the entire assembly allowed to cure. Figure 4 shows the 
molded section after removal from the mold. To facilitate removal, 
the brass mold was coated with mold-release compound.” 

Two methods for scoring the fibers have been studied. An abrasive 
spray of compressed air and dental abrasive powder was found to be 
effective in scoring the fibers where they are exposed. Alternatively, 
a narrow carbide blade can be drawn across the fibers. After the fibers 
have been scored, the entire assembly is fractured by bending the 
tape and applying axial tension. This can conveniently be done by 
using the device described in Ref. 12. The theory of glass fracture under 
these conditions is derived in Ref. 7. We found that the fracture of the 
molded plastic always took place in the same plane as that of the fibers, 
and the prepared tape ends looked as shown in Fig. 5. 

To complete a splice, the two ends prepared as described above are 
placed in a suitable alignment channel, described at the end of Section 
III, and an index-matching fluid is added. 


"Satisfactory results were obtained with Dow Corning “Pan Shield’’ silicone 
spray. This can be removed easily with acetone. 
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Fig. 5—Tape end after scoring and fracturing. Note the polyester resin breaks in 
the same plane as the optical fibers. 


lll. SPLICE LOSS MEASUREMENTS 


The accurate measurement of small losses in good fiber-tape splices 
requires each fiber of the tape to be excited in the same way and with 
the same amount of light. Repeatable and stable focusing of the input 
beam on a given spot within the fiber core is a prerequisite. To ac- 
complish this, we built the launching assembly shown in Fig. 6, 
which is rigidly attached to a commercially available HeNe laser. 
The assembly not only holds the fiber tape and shields it and the laser 
beam from dust and air movement, but also permits a precise visual 
alignment by way of a built-in microscope. The tape is epoxied to a 
tape holder which slides into a micropositioner from the right of Fig. 6. 
A three-way alignment of each fiber-front face with the focus of the 
launching lens is possible (only one positioning screw is shown). 

The launching lens, the beam splitter, and the eyepiece form a 
microscope arrangement that provides a magnified view of the front 
end of the tape. When the beam is launched into one of the fiber cores, 
the entire core area lights up as a result of back-scattered hight from 
the inside of the fiber. A cross hair built into the eyepiece facilitates 
the alignment of beam focus and fiber core. The beam splitter also 
serves as an attenuator and deflects a portion of the laser beam onto 
a silicon detector that provides a reference signal proportional to the 
input light power. As the circuit diagram of Fig. 7 indicates, both the 
transmitted signal and the reference signal are compared in a ratio 
meter where the result is displayed in digital form. This measuring 
technique, together with the precise visual alignment, was found to 
reproduce the excitation of each fiber to within +0.01 dB over a 
period of at least one hour. 
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Fig. 6—Apparatus used for loss measurements. 
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Fig. 7—Cross section of beam-launching assembly. 


Radiation losses of the above order of magnitude (0.01 dB/m) can 
be incurred in the high-order modes as a result of minute bends along 
the fiber. The bends, and hence the loss, depend on the particular 
placement of the fiber and its holding fixtures, and can change during 
the breaking and splicing operation. To minimize the influence of this 
effect, the focal length of the launching lens is chosen to excite a 
numerical aperture of 0.10 out of the total numerical aperture of 0.15, 
so that some of the high-order modes are not excited. On the other hand, 
these modes may participate to some extent in the transmission (and 
in the splice loss), if long fiber lengths are involved. The resulting 
change in splice loss is expected to be small, but needs to be explored. 

Our splicing tests were conducted by making two molded sections 
approximately 10 cm apart near the center of a 2-m length of tape. 
The same mold in the same orientation was used for each molded 
section. Prior to the splicing test, the transmission of each fiber in the 
unbroken tape was measured using the apparatus previously described. 
The two molded sections were then fractured, the center 10-cm 
section removed, and the two ends replaced in the mold. A drop of 
glycerine was added for index-matching, and the ends of both molded 
sections were held down with a single Teflon* blade about 1 mm in 
width. The transmission of each fiber was again measured. 

Figure 8 is a histogram of the measured losses for tests on 20 fibers, 
and Fig. 9 shows the cumulative loss distribution. In two cases, either 
because of inadequate scoring or because of a weak region in the 
fiber, a fiber fractured at a point other than the desired fracture point. 
These cases have not been included in our data. Also omitted are three 


“ Registered trademark of Dupont Corporation. 
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Fig. 8—Histogram of loss measurement data. 


cases where fibers in one of the tapes were broken during handling. 
The average measuring error for these experiments was found to be 
0.01 dB. No correction for this scatter was applied to the data. 
Permanent splices were made using epoxy instead of glycerine as an 
index-matching material. This did not increase the splice loss when the 
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Fig. 9—Cumulative distribution of measured splice losses. 
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Fig. 10—Completed splice using brass alignment channel and epoxy cement both 
for index matching and for holding the assembly together. 


mold was used as an alignment channel. To make more compact and 
permanent splices, we epoxied molded tape ends into either a channel 
made from 125-um-thick phosphor bronze sheet or a 1-mm-by-0.3-mm 
milled brass channel, as shown in Fig. 10. The resulting loss increase 
of between 0.1 and 0.3 dB was attributed to imperfections of the mold 
that were reproduced in the molded ends. The use of alignment sleeves 
to alleviate this problem is discussed in the next section. 


IV. DISCUSSION AND CONCLUSIONS 


The splice losses reported in Section III are lower than those re- 
ported for other splice techniques®:” and are comparable with the losses 
we have observed using a grooved substrate to align fibers with 
previously prepared ends.” We believe, however, that the technique 
described in this paper has a potential for adaptation to a variety of 
fiber-connector problems and to field splicing of fiber cables. 

There are two ‘precision’ operations involved in our splicing 
technique, (z) the intial placing of the prepared fiber tape in the mold 
and (22) the placing of the molded ends in the holder. Suitably designed 
molding tools and alignment sleeves should facilitate these operations 
to the point where they require no special skills. One important 
problem is that of maintaining a high degree of cleanliness. If the 
same mold is used to prepare both ends to be spliced and this mold 
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is also used as the alignment channel, a high-precision mold is not 
required. If, on the other hand, the prepared ends are to be joined in a 
different alignment channel, precision molds must then be used. The 
use of precision molds does not present a major problem, however, for 
the molds can be made, tested, and cleaned in advance, and these 
molds can, at a later time, be prepared for reuse. In this case, separate 
molds would be used to prepare each tape end; we thus avoid the 
problems inherent in the first-described technique associated with 
cleaning a mold in the field. 

A definitive design of the alignment channel is beyond the scope 
of this work, since this design must depend considerably on the 
constraints of a specific connection: a removable or permanent con- 
nection of one fiber tape inside a building, for example, has different 
requirements from a cable field splice. However, some guidelines 
common to all designs can be extracted from our work: the reference 
surfaces used for the alignment of two molded terminations should be 
narrow and well defined to avoid misalignment as a result of mold 
irregularities and contamination. Thus, rather than using the channel 
of Fig. 10, which encloses the terminations on three sides, a sleeve that 
makes contact with the terminations only along their narrow sides 
may be more practical. More specifically, the sleeve might comprise 
two V-grooves designed to accept the V-shaped narrow sides of ter- 
minations molded in the shape shown in Fig. 11. The width of the 
V-grooves should be somewhat wider than the thickness of the ter- 
minations so that contact is only made along the grooves and not at 
the top and bottom surface of the molded piece. Besides minimizing 
and defining the alignment surfaces, this approach permits a space- 
saving and simple arrangement of stacks of terminations in the case 
of a fiber cable, as illustrated in Fig. 11. This figure shows two grooved 
chips spring-mounted opposite each other inside a cartridge that aligns 
the terminations and serves as the inside housing and protection for 
the cable splice. 

This approach, together with the molding technique, seems partic- 
ularly well suited to produce removable connections. In this case, the 
index-matching epoxy or liquid would be advantageously replaced by 
a gel that can easily be removed when the connection is dismantled. 
When such terminations are prefabricated in the factory, the tech- 
nique proposed here offers the additional advantage that the termina- 
tions can be delivered unfractured, so that the end faces remain pro- 
tected until they are fractured on site shortly before the connection is 
made. 

In conclusion, we have described and demonstrated a new splicing 
technique for optical fiber tapes that yields an average splice loss of 
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Fig. 11—Grooved alignment sleeve for cable splicing and connectors. 


less than 0.1 dB. This technique involves no handling of individual 
fibers and alleviates some of the difficulties of fiber-end preparation and 
mechanical alignment of previously prepared ends that are encountered 
with other techniques. 

An adaptation of our technique for field use would involve develop- 
ing a molding technique applicable at room temperature with a quick- 
setting plastic encapsulating material and the design of molding tools 
and alignment sleeves to facilitate the joming operation. 
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Coupled-Mode Theory for Anisotropic 
Optical Waveguides 


By D. MARCUSE 


(Manuscript received December 11, 1974) 


The well-known coupled-mode theory of waveguides is extended to in- 
clude dielectric guides made of anisotropic materials. Exact coupled-wave 
equations for anisotropic dielectric waveguides are derived, and explicit 
expressions for the coupling coefficients are given. The coupling coefficients 
for isotropic waveguides are obtained as a special case. A simple approxi- 
mation for the coupling coefficients in the case of slight anisotropy and 
slight departure from an tdeal waveguide 1s presented. 


l. INTRODUCTION 


The theory of dielectric optical waveguides deals with electro- 
magnetic wave propagation in optical fibers and in the waveguides 
used for integrated optics. Wave propagation in these structures is 
described in terms of normal modes.'!~* However, normal modes pre- 
serve their identity only in perfect waveguides without irregularities 
of either the refractive index distributions or the waveguide geometry. 
Electromagnetic wave propagation in waveguides with any kind of 
irregularities must be described by means of coupled-mode theory.*4 
The electromagnetic waves in imperfect waveguides are expressed as 
superpositions of all the modes of a perfect waveguide. The mode 
amplitudes are coupled together by coupling parameters that depend 
on the nature of the waveguide imperfections. A description of wave 
propagation by means of coupled-mode theory allows calculation of 
radiation losses caused by intentional or unintentional fluctuations of 
the refractive index along the axis of the waveguide or by core-cladding 
boundary fluctuations.?:? Coupling among guided modes is used to 
design modulators or distributed feedback circuits for lasers or to 
effect improvements in the multimode dispersion properties of over- 
moded waveguides. The coupled-mode theory is well developed for 
waveguides that consist of isotropic dielectric materials. Some work 
has been done to extend this theory to waveguides consisting of aniso- 
tropic materials.°-? These waveguides are assuming increasing im- 
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portance in integrated optics as methods are being perfected for 
fabricating waveguides by diffusing different dopants (or outdiffusion 
of certain component atoms) into anisotropic crystals.°-" 

This paper describes the derivation of coupled-wave equations for 
the modes of waveguides consisting of anisotropic materials. The 
coupled-wave theory is based on the definition of guided and radiation 
modes as solutions of Maxwell’s equations for idealized structures. An 
orthogonality relation is derived that is needed to isolate individual 
terms in the infinite series expansion of the electromagnetic field. The 
principal result of this theory is the derivation of coupling coefficients 
that are important for solving coupled-mode problems. Readers not 
interested in the derivation should look at eqs. (46) and (48). Applica- 
tions of this theory are not presented here, since they will be the sub- 
ject of further publications. 


ll. THE FIELD EQUATIONS FOR ANISOTROPIC MEDIA 


The derivation of coupled-wave equations for anisotropic dielectric 
waveguides follows closely the procedure used for deriving coupled- 
wave equations for isotropic waveguides.* The objective of coupled- 
wave theory is to construct solutions of Maxwell’s equations for wave- 
guiding structures consisting of general refractive-index distributions. 

Anisotropic media are characterized by a dielectric tensor, 


€e— |] €yzx Cyy Cyz 7° (1) 


We assume that the elements of this tensor are real quantities char- 
acteristic of lossless materials. It can be shown that conservation of 
energy requires that the dielectric tensor form a symmetric matrix 
so that the following relations hold :" 


€xy = Gyn; Exe = €Ezay €yz = Ezy. (2) 


The magnetic properties of the medium are assumed to be the same 
as that of a vacuum so that we use the (isotropic) magnetic permea- 
bility constant wo. Maxwell’s equations for anisotropic media assume 
the form — 

V XH = we:-E (3) 


V XE = — twwoH. (4) 


It was assumed that the electric field vector E and the magnetic field 
vector H have the time dependence, 


err, (5) 
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The tensor notation «-E may be expressed in component form as 
(e-E); = e:;f;. (6) 


Summation over double indices is understood, and the subscripts 2 
and j assume the values 1, 2, and 3 that represent the zx, y, and 2 
components of the vector E or tensor e. 

Derivation of coupled-wave equations for isotropic media is facil- 
itated by expressing the longitudinal components of E and H in terms 
of the transverse components.’ This practice is preserved for our deriva- 
tion of coupled equations for anisotropic media. We single out the z 
coordinate as the direction of the waveguide axis and express the field 
vectors and the differential operator V as superpositions of transverse 
and longitudinal parts. The symbol ¢ indicates the transverse direc- 
tions x and y. Thus, we have 


E=E,+E., (7) 
H = H, =e hz, (8) 
and 
F 
aU aa or (9) 


We use the notations e,. e,, and e, to indicate unit vectors in 7, y, 
and z directions. 

The transverse part of the vector «-E is indicated by the notation 
é,-E or, in component notation, 


én°E = @2(€srlic + €xyliy + €xzllz) (10) 

ey E = ey (eysBs + €yyHy + eyel 2). (11) 
The longitudinal part is 

é2°E = @2(és2 cs + €zyMy + €zelt,). (12) 


We may now separate Maxwell’s equations into transverse and longi- 
tudinal parts. The transverse parts of (8) and (4) are 


V.xXH.t+e:z X ae = 1we,-E (13) 
V.x E. +e. X oe = — 1wpyoH;. (14) 


Their longitudinal parts may be written as 


V. X Hi = w(e.:E, + €2-Ez) (15) 
and 
Vi XE, = — twuoH.. (16) 
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The longitudinal parts of E and H follow immediately from (15) and 
(16), 





E. se I VV: x H, = EZ Poe OF (17) 
LWEz2 Ezz 
and 
Sa Se, (18) 
tadpo 


On the right-hand side of (17) and (18) appear only transverse com- 
ponents of E and H. It is important to distinguish between the single 
and double subscript notation of «. A double subscript, like e,,, indi- 
cates a single tensor element of e, while a single subscript, like e., is 
defined by (10) through (12). In particular, we have 


€,-Ei = @2(€sr2le + €zyl,). (19) 


We now use (17) and (18) to eliminate the 2 components of E and H 
from the transverse parts of Maxwell’s equations (18) and (14), 


we aie Ey he, eo 
two 02 
: ] 
= we, Ey = = €,°€,'E; + an Er" (V; x H,) (20) 


22 


and 
] OE, 
Vi x Vv: x H, o- cE | + e, x ae a twp oH. (21) 





LWEx2 


These two vector equations represent four scalar equations. Once 
eqs. (20) and (21) are solved, the 2 components of E and H can be 
obtained by simple differentiation from (17) and (18). We have thus 
achieved a simplification of the original problem by reducing the num- 
ber of equations from six, in (3) and (4), to only four. 

The components of the e tensor are assumed to be functions of 2, y, 
and z. The e« tensor defines the wave-guiding structure. Because of the 
z dependence of e, eqs. (20) and (21) do not have mode solutions. A 
normal mode is defined as a solution of Maxwell’s equations whose z 
dependence can be expressed by the simple function 


en Bz. (22) 


Such solutions exist only if the dielectric tensor does not depend on 
the z coordinate. To construct solutions of the general eqs. (20) and 
(21), we consider solutions of simpler equations that are defined by 
a tensor é that is similar to e but is independent of z. The choice of é 
is obviously arbitrary and is determined by convenience. Using (22), 
we find from (20) and (21) the following equations for the normal 
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modes of the waveguide structure defined by é: 


1 ; 
— —V.xX (Vi X € ?)) — 78”e, KX HY 
LW 0 


a tw | I, 
= 1WEt° Eo = z.. Ep 2° GP ae os €;* (Vv, x Jey? (23) 
Zz 


28 


and 





1 1 
Vix a VEN Gy 2.89? | 


LWEz2 €zz 


— 18Me, X EP = — topo. (24) 


The subscript v indicates a mode label. Equations (23) and (24) admit 
an infinite number of solutions with different eigenvalues (propagation 
constants) 8% and different field vectors &” and #”. Script letters 
indicate mode fields, while roman letters E and H are reserved for gen- 
eral field distributions. The modes are of two different types, guided 
modes whose fields are confined to the vicinity of the waveguide and 
radiation modes that extend to infinity in transverse direction to the 
guide.?:? Guided modes have discrete eigenvalues 8%’, while the eigen- 
values of radiation modes form a continuum. The superscript (p) 
stands for either (+) or (—), depending on the direction of wave 
propagation. A wave traveling in the positive z direction has positive 
(real) values 6S“, a wave traveling in the negative z direction has a 
negative (real) value 8S. In isotropic media, we have the simple 
relations, 


Bf? = — Bo, (25) 
So 8, so aS Se, (26) 

and 
eo) = —9e, gel = ae, (27) 


General anisotropic media are more complicated, so that (26) to (27) 
do not apply. Modes traveling in one direction may be different from 
modes traveling in the opposite direction. 


Ill. ORTHOGONALITY RELATIONS 


The modes of anisotropic dielectric waveguides are mutually 
orthogonal.? For the purpose of deriving orthogonality relations, it is 
simpler to use Maxwell’s equations in the form (3) and (4) instead of 
the form (23) and (24). Separating the z derivatives from the V 
operator, we write (3) for a mode labeled » and (4) for a mode labeled u, 


Vix @R” — Be, XK RH” = twe- Ei?) (28) 
and 
V.X &© — pe. X &9 = — two. (29) 
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Next, we take the complex conjugate of (28), multiply the resulting 
equation by &”, multiply (29) by —3¢e”, add the two equations, and 
Integrate over the infinite cross section (€ is assumed real) : 


[ [tse-v. x 5 ()* -_ IP. , x 6 
+ 1B 2"8 -e, X HP” + 1BMICP”-e, X EM }dady 


= — tof [[ee-e-8e" — wsee*-serydrdy. (80) 
The first two terms on the left-hand side of (30) can be expressed as 
= | | Vi (89 X 3")dedy = — / (8 X 5¢")-nds. (31) 


The two-dimensional divergence theorem was used to convert the 
integral over the infinite cross section in the z-y plane to an integral 
over the infinite circle with outward normal direction n and line ele- 
ment ds. The integral on the right-hand side vanishes if at least one 
of the two modes is a guided mode. If both modes » and yp are radia- 
tion modes, the integral vanishes in the sense of a delta function of 
nonzero argument.” Using this fact and a well-known vector identity, 
we can express (30) as 


ON) i: : e.-(& X 3”) dxdy 
= cS of [lee en, pdt? -3C” |dady. (32) 


Because of the symmetry of the é tensor, the following relation holds: 
EP".2-8O = 6.2.80", (33) 


We take the complex conjugate of (82), interchange the superscripts p 
and q as well as the subscripts » and y» and, using (33), subtract the 
new expression from (32) with the result: 


CB Gr) / J e. [8% XK 5" + §?* XK HM ]drdy = 0. (34) 


Equation (34) is the desired orthogonality relation. It is obvious that 
this expression holds also for isotropic media. However, in the iso- 
tropic case it is possible to use (25) through (27) to prove that each 
term in (34) must vanish separately.” For the general anisotropic case, 
(34) cannot be simplified further. We infer from (34) that the integral 
vanishes if 8 — B* + 0. This means that the integral vanishes even 
in the case »y = uw if p and q indicate opposite signs, and a wave is 
orthogonal to its backward traveling counterpart (if 6 is real) if 
orthogonality means vanishing of the integral in (34). 
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The integral in (34) expresses the total power flow if v = uw and 
p = q. We may therefore use the orthonormality relation, 


J [esCep x sep" + 8" x se Jandy 
Be? + BP" 
18s | 


to express mode orthogonality and normalization. The subscripts ¢ 
indicating the transverse parts of the modes were added since the z 
components of the fields do not contribute to (35). P is a normalizing 
factor common to all modes that is used to adjust the arbitrary ampli- 
tudes of the normal modes. For real values of 8, we have 


gM = 1, (36) 


= 2s( Pé,,, (385) 


In this case, the sign of the integral is expressed correctly by the fact 
that Bf reverses its sign if g goes from (+) to (—). For opposite signs 
of p and q, the right-hand side of (35) vanishes as required by (34) 
if BS is real. For imaginary 8, (35) vanishes for g = p. The orthog- 
onality relation also holds for imaginary values of 8. Imaginary 
values of the propagation constants occurs only for evanescent “‘radia- 
tion’’ modes.?:3 In the case of imaginary 8, the sign of the right-hand 
side of (35) is not certain. For this reason, we have introduced the factor 
sf? that must be adjusted so that P is a positive real quantity. This 
means that s” may have to be negative, s = — 1. However, this 
case can arise only in connection with evanescent “radiation”? modes. 
The 6,, symbol in (35) indicates Kronecker’s delta if both modes are 
guided. When one mode is guided while the other is a radiation mode, 
we have 6,, = 0. If both modes are radiation modes, 6,, must be 
interpreted as the Dirac delta function. 


IV. DERIVATION OF COUPLED-WAVE EQUATIONS 


Any arbitrary field distribution compatible with Maxwell’s equations 
can be expressed as the superposition of all the modes of the idealized 
structure defined by the dielectric tensor ¢. Because the complete set 
of modes consists of a finite number of guided modes plus a continuum 
of radiation modes, we express the transverse parts of a general field 
by the expansion 


E,= Darepr +O f° ar(p)8i”(o)dp (37) 
vip Pp 0 
and 
H, = DaMse? +> | a’ (p)5e(p)dp. (38) 
v,p Pp 0 
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The longitudinal parts follow from (17) and (18). The superscripts 
assume the values (+) and (—) indicating waves traveling in positive 
and negative z direction. The first terms in (37) and (388) represent the 
contribution of the finite number of guided modes labeled v. The 
second terms indicated combinations of sums and integrals. The 
integration ranges over the entire region of continuous-mode labels p 
and includes radiation modes with real as well as imaginary values of 
6°?) (p). The summation symbol in front of the integral sign indicates 
that, in addition to modes traveling in positive and negative z direc- 
tion, various types of radiation modes exist and must be added to 
obtain the complete set of modes. For the purpose of deriving coupled- 
wave equations, the notation of (37) and (88) is too cumbersome. We 
use an abbreviated notation by omitting the integration sign, leaving 
it understood that the summation symbol includes summation over 
guided modes and summation as well as integration over radiation 
modes. We thus write 


E, = Dame (39) 
v,p 
and 
H, = dD) afPae?. (40) 
vp 


é& and 3c!) are independent of z, but a” is a function of z. Substitu- 
tion of (39) and (40) into (20) and (21) and use of the mode eas. 
(23) and (24) leads to 


(p) 
» (S Ee iray Vee. x 5) 


v,p 


1 1 
= > a?) ji = Ex)» & 0? — WwW ( — Cj bg = 6 63 Eo? 
v,p 


Ezz €zz 
(dens) oxa 
and 
to (D) q (P) (p) 
me + 1B, a,” )(ez X &3?) 
v,p 
=— >a” Vix a (2-2) ¥ x se 
v,p : 1a €zz Ezz : 


— (= 6--4): sp |}. (42) 
Ezz Ezz 


We take the scalar product of (41) with — &{?* and of (42) with 3¢9", 
then we add the two equations, integrate over the infinite cross section, 
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and use the orthogonality relation (85). The result of this procedure is 


Bo | Bo ( r) Pay) 
D7) Bt P Me 
= Tel 


= » oie hee -(e; fe é,)° Eg? 
v,p 


F * €¢° Ez €4° Es * Et Ey 
+ iwsyr.( Ee — £2), gp — gor. — EB ).(y, x sep 


Ezz Ezz Ezz Ezz 








— 300"-V, X | e(=- =)¥. x se? 
= (= 2 &). 09 ||axay. (43) 


On the left-hand side of (48), we have used a superscript (r). This 
notation is necessary to distinguish between the case of real and 1m- 
aginary propagation constants 6. If 6 is real, we have r = q. If 
8 is imaginary, as it is for evanescent radiation modes, we must choose 
for + the sign opposite to g. We now write (43) in the abbreviated form 


(7) 
da, 
dz 





= — i8Pa? + D KEPa”. (44) 
v,p 


The coupling coefficient is defined by (43). We may eliminate the 
transverse magnetic-mode field vector from the coupling coefficient 
by using (17) (applied to the mode field) and the identity (which is 
obtained by partial integration), 


| | ge". (W, X F)dxdy = | J (v, X 3¢2")-Fdrdy. (45) 


The coupling coefficient can be expressed as 
: Iw | BS | Ep: €z Er E 
ee = 45 8"P i EO". — ? — _ (e - Et) - Ei? 
— Ef". €zz ep = & -§ + §{) 
Ezz = ‘ 


+ (é,-89" + wie — eat Ef?) +- e?) 


€zz 


(= -- #). esp | ay (46) 
€zz ZZ 


V. IMPORTANT SPECIAL CASES 


In its complete form, (46), the coupling coefficient is very compli- 
cated. For isotropic media, where the e tensor degenerates to a multiple 
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of the unit tensor, (46) simplifies to the exact form, 


(7) Z 
Kin = geirtawp | [ (c- | seep + ee" 2m | ardy, (47) 
in complete agreement with the well-known result.” 

In many practical applications, the anisotropy of the dielectric 
medium is only slight, and the difference between the actual dielectric 
tensor « and the ideal tensor é is small. In that case, (46) can be sub- 
stantially simplified. A reasonable approximation of (46) for slight 
anisotropy and small values of « — é is 


(r) 
KS naw i / EO". (e — &)- 8M dady. (48) 
Note that the whole vectors of the electric mode fields enter (48) and 
not just the transverse or longitudinal parts. The approximation (48) 
is obtained by considering off-diagonal elements and differences be- 
tween diagonal elements of e and é as quantities that are small of first 
order. Products of two first-order quantities have been neglected. For 
readers who did not follow the detailed derivation, we repeat here 
briefly the definitions of symbols appearing in (46) through (48). The 
symbol w is the angular frequency of the electromagnetic field. The 
script symbols 8&{” indicate the electric-field vectors of normal modes 
of an idealized waveguide that is defined by the dielectric tensor 
é = é(x, y), the subscript v is a mode label, and the superscript (p) 
stands for either (+) or (—), indicating the direction of wave propaga- 
tion. The propagation constants 6{” of the modes are labeled in the 
same way as the field vectors. The superscript r is usually identical 
with the superscript g. Only in the case of imaginary 6” (this case _ 
happens for coupling to a nonpropagating radiation mode and is of 
little practical interest) does r indicate the sign opposite to q. Likewise, 
s\? = 1 for most cases of interest. Only for imaginary values of 8{” may 
it become necessary to choose s{? = —1 to keep the power normaliza- 
tion coefficient P positive in (35). The asterisk indicates complex con- 
jugation. Subscripts ¢ and z occurring in (46) and (47) refer to the 
transverse and longitudinal parts of the vectors to which they are 
attached. Similar subscripts attached to ¢ are defined by (10) through 
(12) and (19). The dielectric tensor « = e(x, y, 2) defines the actual 
waveguide (in contrast to the ideal guide that is only a mathematical 
fiction). The integrals are extended over the infinite transverse cross 
section of the guide. Equation (48) assumes the same limit as (47) if 
the dielectric tensor degenerates into a multiple of the unit tensor 
since, in the spirit of the approximation (48), we must use é,./e2.2 = 1. 
Equation (48), even though it is only an approximation, is likely to be 
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of most importance in practical applications because of its simple form. 
For many practical problems, the approximation is Justified and leads 
to sufficiently accurate results. 


VI. CONCLUSIONS 


We have derived coupled-wave equations representing exact solu- 
tions of the electromagnetic field problem for dielectric waveguides 
that consist of anisotropic materials whose dielectric tensor is a function 
of the z coordinate. The field of the general waveguide is expressed in 
terms of ideal modes of a hypothetical dielectric waveguide defined by 
a dielectric tensor whose elements are independent of the z coordinate. 
The main result of this paper is the expression (46) for the coupling 
coefficients. For many practical applications, the exact coupling coeffi- 
clent-can be approximated in the simple form (48). 

The coupled-mode theory for anisotropic dielectric waveguides is 
essential for the solution of problems of mode propagation in integrated- 
optics guides with random or systematic irregularities. A particularly 
important area of applications are guides that are made anisotropic 
by an externally applied dc voltage or whose anisotropy is changed by 
such a voltage. Instead of an applied voltage, an acoustical wave may 
cause an anisotropic change of the refractive index of a dielectric wave- 
guide. These cases cannot be handled by the simpler isotropic coupled- 
mode theory, but require the extension to anisotropic media presented 
here. 
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Apparent changes tn the coercivity and remanence of the magnetic reed 
material used in the remreed sealed contact were encountered during the 
glass-to-metal sealing operation. By using primarily metallographic ob- 
servations correlated with magnetic data, it was determined that these 
changes were due to a modification of the G00°C-aged microstructure caused 
by the time/temperature cycle experienced in sealing. The room-tem- 
perature stable precipitates that are developed by the GOO°C anneal to 
produce magnetic hardening are dissolved, and the alloy converts to a two- 
phase duplex body-centered cubic (Bcc) microstructure along one-half to 
two-thirds of the reed shank. This produces a substantial increase in co- 
ercive force and some decrease in remanence. The structure and properties 
of the alloy before sealing have little influence on the reaction. 


I. INTRODUCTION 


A recent effort in the telecommunications industry has been the 
development of a remanent reed, sealed contact (remreed).! One 
problem encountered in this development was an apparent change in 
the magnetic properties of the Remendur (Fe, Co, V alloy) reed during 
the glass-to-metal sealing operation. During sealing, the reed can reach 
peak temperatures in excess of 1050°C and may be at temperatures in 
excess of 900°C for as long as 8 to 10 seconds. Preliminary data supplied 
to the authors indicated an increase in the coercivity of reeds from the 
sealing operation,? whereas the results of Kitazawa, Oguma, and Hara 
showed a 22-percent decrease in coercivity at switch sealing.’ There- 
fore, a description of the exact nature of this change and an explana- 
tion of the mechanism by which it occurs were sought. Also, informa- 
tion on the possible influence of variations in the time/temperature 
sealing cycle on the magnitude of this change was considered relevant 
to determine if improvements could be achieved by such modifications. 


997 


This paper summarizes the results of both a laboratory study on the 
Remendur alloy and evaluation of manufactured contacts to ascertain 
the mechanism for the reported property changes. The normal sealing 
operation was also modified by increasing the sealing speed and reduc- 
ing the sealing-lamp temperature to obtain data on possible modifica- 
tions to minimize the changes. Also presented are data on the influence 
of microstructure from the prior strand-anneal and 600°C-aging 
anneal-heat treatments on the stability of the aged magnetic properties. 


ll. EXPERIMENTAL PROCEDURES 


The evaluation on actual sealed reeds was carried out on Western 
Electric Company assembled contacts. A group of reeds stamped from 
a single coil of commercially melted and processed 0.535-mm Remendur 
wire was aged at 610°C in hydrogen for two hours. Some reeds of this 
group were assembled into contacts at two separate manufacturing 
facilities. Samples of the aged reeds and reeds that had been removed 
from these assembled contacts were supplied to the authors for evalua- 
tion (Experiment A). Additionally, contacts that had been sealed at an 
accelerated speed were provided (Experiment B). Finally, a two-group 
sample was supplied consisting of six typical contacts and six contacts 
sealed with a 100-W reduction in power on the sealing lamps (Experi- 
ment C). 

The laboratory study was carried out on a selection of coils of wire 
from another commercial melt of Remendur (48.7-percent Fe, 48.2- 
percent Co, 2.90-percent V by weight with balance impurities Mn, 
Ni, C, 8). The wire was supplied in a 900 to 950°C strand-annealed and 
quenched condition. Samples from the front and back ends of every 
coil from this melt were examined metallographically. A variety of 
microstructural variations were detected in this annealed wire. These 
ranged from a nearly-all-a,; phase structure produced by an anneal at 
the lower extreme in the temperature range to an all-az phase structure 
produced by an anneal at the upper temperature extreme.*® A sample 
of 15 front and back coil ends covering this range in structures of the 
0.535-mm strand-annealed wire was selected. The wires were aged at 
620°C for three hours in an argon/10-percent hydrogen-gas mix- 
ture. Each of these coil ends was then divided into three separate 
lengths of wire. Each wire length was subjected to one of three differ- 
ent high-temperature exposures. These exposures were carried out by 
inserting the wires for a fixed length of time into an open-air muffle 
furnace set for a fixed elevated temperature. They were gripped in 
asbestos-covered tongs for the insertion to prevent the tongs from act- 
ing as a heat sink. The three time-temperature cycles used on the wires 
are shown in Figs. 1, 2, and 3. These profiles were produced by insert- 
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Fig. 1—Time/temperature thermocouple response for a 12-s insertion into a 1200°C 
furnace. 


ing a chromel-alumel thermocouple made from 0.535-mm wire into the 
furnace in the same manner as the Remendur wire samples and moni- 
toring the response on a strip-chart recorder. These cycles were selected 
to nearly duplicate typical sealing, reduced-temperature, and ex- 
tended-time cycles, respectively. The typical cycle was determined 
from a previous study on the sealing of 237-type ferreed contacts. ° 
The production reeds and the laboratory samples in each heat- 
treated condition were evaluated metallographically and the magnetic 
parameters were determined. Metallographic preparation was routine, 
using a 5-percent Nital etch for 15 to 30 seconds. Structures were 
observed by optical light microscopy at 1500 magnification using 
Nomarski Differential Interference Contrast (pic). All magnetic data 
were provided to the authors by E. C. Hellstrom. Coercivity and 
remanence were measured from full-loop traces on samples magnetized 


SEALING OF REMENDUR- 999 


REDUCED — TEMPERATURE 
CYCLE (2) 





ee ee 


54 48 42 36 30 24 18 12 6 0 
SECONDS 


Fig. 2—Time/temperature thermocouple response for a 12-s insertion into a 1050°C 
furnace. 


by an applied saturation field of 300 oersteds. All magnetic values pre- 
sented are the average of multiple samples. The laboratory samples 
were 2.5 cm in length, and the production samples were actual reeds 
possessing their characteristic geometry (Fig. 4). The reeds were mea- 
sured both as complete reeds with the search coil centered over the 
paddle portion and also as dissociated paddles and shanks. 


lll. RESULTS AND DISCUSSION 


Microstructural comparisons and correlations were obtained on 
longitudinal sections of the shanks. Figure 5 shows the microstructures 
present in a typical reed shank after sealing, at approximately 1-mm 
increments along its length. It is apparent that, in this case, beginning 
at the paddle end, approximately the first 5.6 mm have undergone a 
structural change. This represents approximately two-thirds of the 
shank length. This heat-affected region is substantially larger than 
that actually encompassed by the glass in the seal. 
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Fig. 3—Time/temperature thermocouple response for a 25-s insertion into a 1200°C 
furnace. 


3.1 Laboratory-simulated sealing-cycle exposures 


To determine the significance of this structural change, the lab- 
oratory experiment utilizing various elevated-temperature exposures 
was carried out. The magnetic-properties data for the 15 selected coils 
are summarized in Table I. Of the 15 coils selected, 11 are considered 
to possess a microstructure at least approximating a nearly equal dis- 
tribution of the two phases, a1 + a2, in this duplex Bcc structure.’ 
This distribution is expected from an optimized-temperature strand 
anneal and quench.‘ In this strand-annealed state, these typical wires 
indicate a coercive force of 41 to 47 oersteds and a remanence of 21 
to 23 maxwells for the specified measuring conditions. Three samples 
designated 208B, 240B, and 268B were strand-annealed at the low 
temperature extreme, i.e., less than about 900°C, and have a nearly 
single-phase structure of a; (Bcc). They have a significantly lower co- 
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Fig. 4—Reed geometry used for magnetic evaluation. 


ercive force with little difference in remanence when compared to the 
two-phase strand-annealed structure before aging. Finally, one sample 
designated 223B was strand-annealed at the upper temperature ex- 
treme, 1e., greater than about 950°C, and has a nearly single-phase 
structure of a2 [the vanadium-supersaturated, Fcc (face-centered 
cubic) to Bcc transformation phase ].7 It shows not only a modestly 
reduced coercive force but a significantly lower remanence when com- 
pared to the two-phase strand-annealed structure. Representative 
microstructures for these three conditions are shown in Fig. 6, column 1. 

Again referring to Table I, it appears that the 620°C-aging anneal 
has a tendency to normalize the magnetic properties. This is primarily 
a consequence of the precipitation of stable y (Fcc) phase,® relief of 
the y — a. transformation strains,’* and ordering of the a; matrix. 
The four high- and low-temperature strand-annealed coils are still 
below the mean for the two-phased strand-annealed coils of 24 to 25 
oersteds in coercive force and of 26 to 28 maxwells in remanence after 
aging. Representative microstructures in the 620°C-aged condition 
are also given in Fig. 6, column 2. 
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Fig. 5—Microstructures along the length of a typical sealed reed shank—etched 5-percent nital, 
approximate end of the heat-affected zone. 
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Table | — Magnetic properties of Remendur wire before and 
after simulated sealing cycles 1, 2, and 3 


He (oersteds) 


Strand- 

Strand- Aged Typical Low T Long t 
Sample | Anneal | Annealed 620° 63 hrl| Cycle (1) | Cycle (2) | Cycle (3) 
205 T 2 phase 41.4 24.3 32.8 16.6 28.6 
205 B 2 phase 40.9 22.2 30.6 220k 27.5 
208 B | Primarily ai 29.9 17.3 34.3 13.9 27.8 
218 T 2 phase 46.6 29.0 35.4 — 29.1 
218 B 2 phase 44,7 29.0 36.1 16.0 26.5 
223 T 2 phase 42.6 23.8 37.4 19.7 28.8 
223 B | Primarily a2 39.0 PAE | 33.0 14.2" 27.7* 
231 T 2 phase 45.4 29.6 38.9 16.9 29.9 
231 B 2 phase 47.3 27.3 38.9 18.3 28.9 
236 T 2 phase 38.5 24.3 37.1 — — 
236 B 2 phase 43.5 26.4 36.3 14.5 26.3 
240 T 2 phase 44.6 20.8 37.3 17.1 28.3 
240 B | Primarily a 23.0 19.0 37.3 13.9 29.2 
268 T 2 phase 46.7 27.0 38.9 17.8 27.4 
268 B | Primarily a 21.0 19.0 37.0 — 29.5 


On (maxwells) 


Strand- 

Strand- ged Typical Low T Long t 
Sample | Anneal | Annealed 620° 8 br Gyele (1) | Gycle (2) | Cycle (3) 
205 T 2 phase 22.2 27.2 16.3 21.9 11.1 
205 B 2 phase 22.3 23.2 18.1 24.2 11.3 
208 B | Primarily a; 22.1 21.4 16.5 20.6 11.0 
218 T 2 phase 22.9 27.9 17.3 — 12.9 
218 B 2 phase 22.6 27.5 18.2 22.6 11.8 
223 T 2 phase 22.3 25.2 19.1 21.8 12.2 
223 B | Primarily a2 17.3 20.2 15.1 9.3" 7.6" 
231 T 2 phase 22.8 28.3 20.2 22.9 11.3 
231 B 2 phase 22.9 28.4 20.2 24.6 13.4 
236 T 2 phase 21.9 27.7 19.6 — — 
236 B 2 phase 224 26.6 19.1 19.5 12.0 
240 T 2 phase VIE | 25:4 19.0 23.0 11.4 
240 B } Primarily a; 20.6 20.8 19.5 19.3 122 
268 T 2 phase 23.1 27.4 20.6 22.9 11.2 
268 B | Primarily a; 20.3 21.2 19.6 — 11.9 


Material: 2.90 weight percent V (7) T = front of coil, (#2) B = back of coil, (iz) 
Happ = 300 oersteds, (2v) Values average of two or three samples, (v) Sample length 
= 2.5 cm. 

* Aged at 515°C prior to elevated temperature exposure; 1.6-cm sample length. 


Samples were exposed to the typical-sealing cycle given in Fig. 1, 
which was based on a previous experimental evaluation of the time- 
temperature cycle experienced by a 52-alloy reed in the sealing of a 
type 237 (nonlatching, Ni-Fe) reed contact. This cycle creates an 
exposure to a peak temperature of 1050°C and temperatures in excess 
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lig. 6—Maicrostructures in strand-annealed and 620°C-aged conditions for two- 
phase (231T), primarily a:-phase (240B), and primarily a2-phase (223B) strand- 
annealed structures—etched 5-percent nital, DIC, 1500X. 


of 900°C for approximately 7 seconds. This exposure produces a 
marked increase in coercive force into the range of 33 to 38 oersteds and 
a drop in remanence to 15 to 20 maxwells (Table I) for all samples. 
Representative microstructures given in Fig. 7, column 1, indicate 
that this change is a result of the material altering to the structure 
typically produced by the 900 to 950°C strand-anneal.* That is, the 
y (Fcc) precipitates are dissolved and portions of the matrix return to 
the high-temperature Fcc phase and again undergo the Fcc — BCC 
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TYPICAL—SEALING REDUCED—TEMPERATURE EXTENDED—-TIME 
CYCLE (1) CYCLE (2) CYCLE (3) 
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*AGED AT 515°C; SAMPLE LENGTH = 1.6cm 


Fig. 7—Microstructures following various elevated-temperature exposures on 
samples shown in Fig. 6—etched 5-percent nital, DIC, 1500X. 


transformation on cooling. This produces a highly strained lattice and, 
hence, the increased coercive force values. Similar results were obtained 
by Mahajan and Olsen” by exposing 600°C-aged Remendur to tem- 
peratures of 850, 950, and 1050°C for 5 seconds and measuring the 
magnetic changes as well as observing the structural variations utilizing 
transmission electron microscopy. 

A slightly reduced temperature cycle was attempted to ascertain 
if a modest reduction in sealing temperatures could alleviate this 
change. This cycle, given in Fig. 2, had a peak temperature of 930°C 
and a temperature in excess of 900°C for less than 4 seconds. Magnetic- 
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properties results (Table I) show that the coercive force and remanence 
are both reduced by this cycle. However, the remanence decrease is 
small in all cases and, in general, those samples that had a two-phase 
strand-annealed microstructure before aging and high temperature 
exposure show a lesser decline in coercivity than the single-phase 
strand-annealed structures. Microstructural observations (Fig. 7, 
column 2) indicate that these changes are a result of the beginning of 
the dissolution of the y (Fcc) precipitate. This cycle is sufficiently low 
in temperature and time such that none of the matrix is returned to 
Fcc structure at temperature and, hence, does not undergo the Fcc 
— Bcc transformation on cooling to markedly increase the magnetic 
hardness as observed in cycle 1. 

To clearly prove that a reversion to the strand-annealed structure 
during sealing is the cause for the magnetic property changes observed 
in reeds,? an extended-time-at-temperature cycle was produced (Fig. 3). 
The peak temperature was maintained at 1050°C, but the time in 
excess of 900°C was increased to 20 seconds. In all cases, the structure 
totally converted to a single-phase a, matrix (Fig. 7, column 3), which 
is typical of a strand anneal at too high a temperature (>950 to 
975°C).4 These structures may be compared to the initial structure of 
coil 223B in this experiment (Fig. 6, column 1), which was stated to 
be a nearly-all-a. matrix. This heat treatment produces an extremely 
skewed hysteresis loop with a coercive force of 26 to 29 oersteds and a 
greatly reduced remanence to 11 to 138 maxwells (Table I) as measured 
on l-inch samples. It is thus clear that very brief exposure to tempera- 
tures in excess of 900 to 1000°C can markedly alter the magnetic 
properties previously developed by the 600°C-aging heat treatment 
in Remendur. 


3.2 Analysis of production reeds 


The results of the before-and-after glass-to-metal sealing analysis on 
actual production reeds (Table II) correlate well with the above ob- 
servations on the simulated-sealing temperature exposures. Listed are 
the coercive force and remanence as measured on the complete reed, 
with search coil on the paddle, and also as measured on the dissociated 
paddles and shanks both before and after exposure to sealing (Experi- 
ment A). As was previously observed by others,? the major effect of 
sealing is a marked increase in coercive force on the shank. In this © 
instance, an average increase in the shank of about 6 oersteds from 
23.5 to 29.5 occurred. As expected, no change in the paddle portion 
was observed, since it does not experience the peak temperature range. 

The structural changes observed in the shanks of actual sealed reeds 
are very similar to those produced by the typical sealing cycle in the 
elevated-temperature-exposure experiments. The disappearance of the 
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Table Il— Magnetic properties of Remendur reeds before 
and after glass/metal sealing 


Experi- 


eat Samples Complete Reed— Shank Paddle 


search coil on paddle 


He (oer.) ‘ag Es He (oer.) 


Or 
(maxwells) 
610°C 

Aged reed 27.4 (3) |] 21.5 (8) | 23.6 (8) 5.9 (3 
Sealed reed | 29.7 (12) | 19.3 (12) | 29:0 (16) | 6.7 (16 


B { Accelerated 
speed seal 28.3 (11) | 21.6 (11) | 25.9 (11) 6.1 (11) 28.9 (11) } 10.3 (11) 


Sealed reed— 
norma! lamp 
power 29.5 (6) ] 21.1 (6) | 28.2 (6) 6.4 (6) 29.2 (6) | 10.3 (6) 
Sealed reed— 
reduced lamp 
power 28.0 (6) | 22.0 (6) | 24.9 (6) 6.1 (6) 29.1 (6) | 10.2 (6) 


) | 28.4 (3)] 10.0 (3) 
) | 28.7 (16) | 10.0 (16) 


(1) H applied = 300 oersteds. 
(tt) Numbers in parentheses indicate sample size. 


y (FCC) precipitate and return to the two-phase a; + a, structure or 
all-a, structure are apparent in Fig. 5. 

The results from the reeds sealed by using the accelerated speed 
sealing (Experiment B) and with reduced lamp power (Experiment C) 
show an improvement. As can be noted in Table II, the coercivity 
increase for the complete reed was smaller, and no drop in remanence 
occurred. The data on the dissociated shanks show that a coercivity 
increase still occurred on these shanks, but it was markedly smaller 
than that for the standard sealing operation. Metallographic results 
also showed that a structural change had taken place but was confined 
within a smaller percentage of the shank length. This probably ac- 
counts for the less drastically altered magnetic properties. 


IV. SUMMARY AND CONCLUSIONS 


The alteration of the magnetic properties of Remendur by the glass- 
to-metal sealing operation from those properties developed in the 
nominal 600°C-aging anneal is real. This is a consequence of excessive 
temperatures during the sealing operation causing a dissolving of the 
y (Fcc) precipitate and a reversion of the structure back to that de- 
veloped by the typical 900 to 950°C strand anneal used in processing 
the wire. With the present time/temperature sealing cycle used in 
production, the following more specific conclusions can be listed: 


(¢) From one-half to two-thirds of the shank length, beginning at 
the paddle/shank transition region, is altered by the sealing 
operation. 

(17) A significant increase in coercive force occurs in the shank. 
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(222) Laboratory results indicate that a significant drop in remanence 
also occurs during elevated temperature exposures, which are 
sufficiently high to cause a reversion to the two-phase a; + az 
or all-e, structures. 

(2v) Variations in the microstructure of 0.535-mm wire from the 
strand anneal have little effect on the subsequent structure and 
magnetic properties changes during sealing. 


A reduced-temperature sealing cycle holds some promise for minimizing 
this change, but all 2.5- to 3.0-percent vanadium Remendur alloys are 
likely to exhibit this same behavior independent of vanadium content 
or prior aging heat treatment. If this properties change is considered 
unacceptable, a modification in the sealing cycle or a redesign of the 
contact to account for this change are the most promising approaches, 
since the materials response cannot be altered. 
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A Multibeam, Spherical-Reflector Satellite 
Antenna for the 20- and 30-GHz Bands 


By R. H. TURRIN 


(Manuscript received November 18, 1974) 


A multibeam antenna for satellite communication affords reuse of fre- 
quency allocations and flexibility in traffic routing. A multibeam satellite 
antenna is described that employs a spheroidal reflector in a compact 
periscope design. The aperture 1s 1.5 meters in diameter radiating six 
beams in the 20- and 30-GHz satellite frequency bands. Electrical mea- 
surements indicate that this antenna ts suitable for multibeam satellite use. 


I. INTRODUCTION 


A proposed domestic satellite system for the 20- and 30-GHz radio 
bands includes a multibeam satellite antenna.! The use of separate 
radiating beams to service specific densely populated urban areas is 
desirable since it permits reuse of the frequency bands and flexibility 
in traffic routing. 

A multibeam satellite antenna that operates simultaneously in the 
20- and 30-GHz bands has been constructed and evaluated electrically. 
The antenna is a compact periscope design? employing a spherical re- 
flector 60 inches in diameter, a plane reflector, and a cluster of six feed 
horns. Figure 1 shows two views of the antenna system. A spherical 
rather than a parabolic reflector was employed to permit off-axis beam 
pointing with less degradation in performance than a paraboloid. The 
primary feature of this antenna design is the use of multiple feeds to 
produce separate radiation beams. The feed location format was chosen 
to conform with earth-station locations at approximately New York 
City, Denver, Atlanta, Los Angeles, Honolulu, and Puerto Rico. These 
locations are representative of the maximum beam-pointing require- 
ment for domestic satellite service and the minimum beam-pointing 
constraint owing to physical feed separation. The synchronous orbiting 
satellite was assumed to be located at 100 degrees west longitude. The 
antenna system boresight axis that corresponds to the axis of sym- 
metry of the spherical reflector is designed to intersect the earth at 38 
degrees north latitude and 100 degrees west longitude, near the geo- 
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Fig. 1—Multibeam antenna. 


graphic center of the contiguous United States. With these geo- 
graphical constraints, the maximum beam-pointing angle off boresight 
is about +6.5 degrees in the east-west direction to include Honolulu 
and Puerto Rico. 

Parameters of chief interest in the electrical evaluation of this 
antenna system are isolation between beams, absolute pointing ac- 
curacy of beams, coupling between feeds, and performance degradation 
because of feed-cluster blockage. Although these parameters may be 
evaluated analytically, the last one is especially difficult to analyze. 
For this reason and to demonstrate the others, a laboratory model was 
constructed and measurements were made. 
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Il, CONSTRUCTION 


The laboratory model was constructed to evaluate the electrical 
performance of the multibeam design and is not structurally com- 
patible with space vehicle requirements. The physical size of the aper- 
ture, dictated mainly by measurement facilities, may be considered as 
a scale model for a larger aperture. The 150-cm-diameter spherical 
reflector was fabricated at the Crawford Hill laboratory of urethane 
foam and epoxy materials and used a deep spun-aluminum paraboloid 
for basic structural integrity. The surface was machined to an rms 
accuracy of 0.05 mm using a 244-cm radial arm end mill. The plane 
reflector is a 0.64-cm-thick plate glass mirror employing the metalized 
side as a reflecting surface. The mirror is suspended with its plane 
vertical to minimize warpage resulting from gravity. Six holes are 
bored through the plate glass to admit the feeds with a minimum loss 
of surface area. The feed location format shown by Fig. 2 is on a plane 
normal to the axis of the spheroidal reflector. 
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Fig. 2—Feed format. 
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The feed horns were designed for dual-frequency operation with 
linear polarization. The polarization for the 20- and 30-GHz bands 
are orthogonal. Figure 3 shows details of the feed horn employing thin 
conductive fins that redefine the electrical aperture for the 30-GHz 
band. Since the 20-GHz polarization is normal to the fins, the feed- 
horn walls define the aperture for this band. By using the fins to 
selectively control the feed aperture size, the radiation patterns at 
both 20 and 30 GHz are more nearly the same, resulting in optimum 
illumination for the spheroidal reflector at both frequencies. Figures 4 
and 5 show feed radiation patterns at measurement frequencies of 19 
and 30.2 GHz. 

The axis of each feed was aligned parallel to the axis of the spheroidal 
reflector aperture and the feed-phase centers were arranged in a single 
plane located 120 cm from the center of the spheroidal reflector. 
Measurements on positioning of a single movable feed showed that 
there is no significant difference in radiation characteristics of the 
antenna system for the off-axis beam-pointing directions under con- 
sideration with the feed axis oriented (2) along a radial direction from 
the center of curvature of the spheroidal surface, (72) parallel to the 
axis, or (222) directed toward the center of the spheroidal reflector to 
balance the amplitude distribution. Parallel mounting of the feed axes 
greatly simplifies the feed-line structure. It was also found that arrang- 
ing the phase centers in a plane rather than in the optimum spheroidal 
focal surface produces an error in focal length of less than A/2 at 30 
GHz with little effect on the gain or radiation characteristics of the 
system. 
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Fig. 3—Linearly polarized feed horn, dual frequency (17.7-20.2 GHz and 27.5- 
30.0 GHz). 
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Fig. 4—Radiation patterns of the finned feed horn at 19.0 GHz. 


The polarization orientations of the feeds were mandated by the 
physical constraint of the feeds in New York City (feed 5) and Atlanta 
(feed 4) feeds. These two feeds were parallel-polarized and arranged 
at a skew angle with respect to the measuring plane to achieve a 
minimum physical spacing. The remaining feeds were cross-polarized 
with their adjacent neighbors for minimum electrical coupling. The 
feed locations shown in Fig. 2 were necessary to achieve the desired 
beam-pointing angles for the various earth-station locations. 


lil. MEASUREMENTS 


Electrical evaluation of the multibeam antenna was conducted at 
the Holmdel radio range. A variation was found of incident field over 
the measurement aperture of no greater than +0.75 dB for either 19 
or 30 GHz, the measurement frequencies used. 
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Fig. 5—Radiation patterns of the finned feed horn at 30.0 GHz. 


3.1 Beam pointing 


Beam-pointing accuracy is important in the design of a satellite 
system. Measurement of pointing accuracy for each of the six beams 
was accomplished by first mechanically measuring the location of each 
feed phase center relative to the axis of the spheroidal reflector (bore- 
sight axis) and in the X-Y plane (Fig. 1). Alignment of the boresight 
axis with the radio source was done by optical sighting. All measure- 
ments of beam pointing were made using synchro-type readouts with 
an accuracy of 0.05 degree. Angular measurements are in essentially 
an orthogonal coordinate system for the elevation-over-azimuth 
mount. 

Table I shows all the measured beam-pointing data, together with 
computed data based on feed location, and the pointing errors. The 
feeds are numbered as in Fig. 2. Since the half-power beamwidth of 
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Table |— Measured and computed beam-pointing errors 


Feed Freq Measured | Computed | Pointing Measured | Computed | Pointing 
No (GH 2) Ox Ox Error by Oy Error 
. (Degrees) (Degrees) (Degrees) (Degrees) (Degrees) (Degrees) 

1 19.0 — 1.95 —0.08 +6.82 —0.08 
Hon — 1.87 +6.90 

: 30.2 —1.71 +0.16 +7.28 +0.38 

2 19.0 —0.35 —0.15 +1.50 —0.11 
tA —0.20 +1.61 

me 30.2 —0.15 +0.05 +1.75 +0.14 

3 19.0 +0.20 ~0.17 —0.45 —0.14 
Den +0.37 —0.31 

: 30.2 +0.45 +0.08 —0.30 +0.01 

4 19.0 —0.58 —0.10 — 3.44 —0.19 
Atl —0.48 —3.25 

: 30.2 —0Q.40 +0.08 — 3.40 —0.15 

5 19.0 +0.35 —0.22 —4.17 —0.27 
N.Y +0.57 —3.90 

me 30.2 +0.56 —0.01 — 4.08 —0.18 

19.0 —2.19 —0.07 —6.14 ~—0.25 
6 —2.12 — 5.89 

P.R. 30.2 — 2.05 +0.07 —6.15 — 0.26 


each beam at both 19 and 30 GHz is about 0.65 degree, the maximum 
error encountered in these measurements was about one-half beam- 
width. Considering the readout accuracy, mechanical errors in the 
structure, and boresight errors, these results are remarkably con- 
sistent ; the antenna beams point in the intended directions. 


3.2 Beam coupling 


Beam coupling is a measure of the signal level in one specified beam 
when another beam is aimed toward the source. It is also a measure of 
the interference between any two particular ground stations. The 
radiation characteristics of each beam for specific off-axis angles specify 
the coupling. Measured coupling for all combinations of beams ex- 
cept the New York City-Atlanta combination was found to be less 
than —35 dB. In that case, the angular separation in the beams is 
only 1.2 degrees and the coupling is approximately —20 dB. The data 
were measured at specific incremental frequencies over the 2500-M Hz 
band at both 20 and 30 GHz. 

The beam coupling is strongly dependent on exact pointing direc- 
tions and can be greatly improved by slight misalignment of the main 
beams. It may, therefore, be an engineering expedience to suffer a 
slight degradation in gain by beam misalignment to improve the 
isolation. 


3.3 Radiation patterns 


Studies of the radiation characteristics of this antenna with a single 
horn feed were made prior to installation of the six-feed cluster. These 
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measurements were made to determine the effects of illumination on 
beams pointed off-axis. Figure 6 gives measured values of relative 
gain and first side lobes with off-axis pointing. The asymmetric il- 
lumination with off-axis pointing produces an effect similar to coma 
aberration in which the first side lobe levels become increasingly un- 
balanced. The side lobe nearest the axis of the antenna system is always 
the higher in level, as expected. 

The gain degradation for maximum off-axis pointing is a modest 1 
dB at 19 GHz. At 30 GHz, the extrapolated degradation is approxi- 
mately 2 dB. Representative measured far-field radiation patterns for 
the single feed are shown by Figs. 7 and 8. These are principal plane 
measurements, that is, patterns taken with a single feed whose polariza- 
tion orientation is parallel with the measurement plane; they show the 
typical far-side-lobe behavior of this type of antenna. 

Figures 9 and 10 are far-field radiation patterns at 19.0 GHz for 
feeds 1 and 3, when all other feeds are in place and terminated. These 
are representative of the maximum and minimum off-axis pointing. 
Figures 11 and 12 are for the same feeds at 30.2 GHz. These patterns 
illustrate the off-axis asymmetric radiation characteristics; the effect 


FIRST SIDE LOBE LEVELS 
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ANGLE OFF AXIS IN DEGREES 


Fig. 6—Single-feed measurements of off-axis performance. 


1018 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1975 


RELATIVE POWER ONE WAY IN DECIBELS 





ANGLE OFF AXIS IN DEGREES 


Fig. 7—Far-field radiation pattern at 30.0 GHz, single-feed centered in longitudinal 
plane (azimuth). 


is more pronounced at the higher frequency where the relative aperture 
phase errors are greater. These patterns also indicate that the small 
blockage from the feed cluster causes negligible distortion of the beams. 

The half-power beamwidths are essentially equal for both 19 and 
30 GHz; this is because the feed illumination taper is somewhat more 


SATELLITE ANTENNA ~ 1019 


RELATIVE POWER ONE WAY IN DECIBELS 





ANGLE OFF AXIS IN DEGREES 


ae 8—Far-field radiation pattern at 30.0 GHz, single-feed offset in longitudinal 
plane. 


severe at 30 GHz and because the reflector surface errors cause more 
degradation at the higher frequency. 


3.4 Gain 


Absolute gain measurements shown in Fig. 13 corroborate the 
pattern measurements; the gain falls off faster with off-axis pointing 
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Fig. 9—Feed 3, 19.0-GHz radiation patterns. 


at 30 GHz than at 19 GHz. The plotted values in Fig. 13 show the 
relative gain measured for each feed. These data are consistent with 
single feed measurements where off-axis gain degradation was some- 
what more at 30 GHz than at 19 GHz. The aperture efficiency for the 
feed nearest center, No. 3, is 54 percent at 19.0 GHz and 35.5 percent 
at 30.2 GHz. 


3.5 Reflection coefficient 


The magnitude of the reflection coefficient over each frequency band 
was measured at each feed port. The maximum value is —18 dB typi- 
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Fig. 10—Feed 1, 19.0-GHz radiation patterns. 


cally for all ports over the band 18.3 to 20.38 GHz. Figure 14 shows a 
typical reflection coefficient measurement for the 19-GHz band. The 
periodicity in the reflection coefficient indicates a mismatch at the 
feed horn to be approximately the same magnitude as the echo from 
the spheroidal reflector. The same type of periodicity is observed in the 
reflection coefficient measurement over the 27.5- to 30.2-GHz band 
as shown in Fig. 15. In this case, however, the reflection coefficient is 
much worse with a maximum value of —12 dB toward the low-fre- 
quency end of this band. 
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Fig. 11—Feed 3, 30.2-GHz radiation patterns. 


3.6 Coupling between ports 


Direct coupling between feed ports results from two causes: (2) 
reflections from the spherical reflector into other feed horns and (72) 
direct side-to-side radiation of the feed horns. Direct measurement of 
all 15 feed-coupling combinations (assuming reciprocity) was made on 
a scanned frequency basis. Three of the feeds are essentially cross- 
polarized with respect to the other three, and those nine cross couplings 
that involved the cross-polarized condition were found to have cou- 
pling levels less than —40 dB. Of the remaining six combinations of 
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Fig. 12—Feed 1, 30.2-GHz radiation patterns. 


coupling for each frequency band, the worst case is for the adjacent 
feeds 4 and 5 (New York-Atlanta), which show values of —24 dB at 
20 GHz and —26 dB in the 30-GHz band. All others show coupling 
less than —28 dB, consistent with reflection coupling from the spheri- 
cal surface alone. All unused ports were terminated during these 
measurements. 


IV. CONCLUSIONS 


Measurements on a compact spherical periscope antenna indicate 
that this design is suitable as a multibeam satellite antenna for the 
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Vig. 13—Absolute gain measured at feed port 3 by standard gain horn techniques. 
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Fig. 14—Feed port 4, reflection coefficient. 
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Pig. 15—Feed port 5, reflection coefficient. 


20- and 30-GHz bands. While the aperture diameter, 150 cm, may 
be somewhat small for practical use, most measured results are equally 
applicable to larger aperture designs. 

The only feature of the design not amenable to analysis is the block- 
age and scattering by the cluster of feeds. However, the measurements 
reported here indicate that this effect is of little consequence in the 
intended application. It is also clear that some limitations of the 
embodiment discussed here are inherent in the feed design. Minimum 
physical separation between feeds limits the minimum angular beam 
separation to about 1.2 degrees in this particular antenna, isolation 
between these beams being of the order of —20 dB. 
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Polarization-Independent, Multilayer 
Dielectrics at Oblique Incidence 
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This paper gives solutions for multilayer dielectrics that yield polariza- 
tion-independent operation at a given angle of incidence and at a given 
frequency. An exhaustive analysis is given for symmetric three-layer 
dielectrics surrounded by the same medium. Special solutions are also 
gwen for symmetric multilayer dielectrics with more than three layers 
surrounded by the same medium and for multilayer dielectrics surrounded 
by different media. Rules are presented for obtaining new polarization- 
independent solutions by cascading given solutions. Finally, for the purpose 
of demonstration and comparison of various solutions, different design 
examples are carried out for polarization-independent 3-dB beam splitters 
to operate at millimeter wavelengths. 


I. INTRODUCTION 


The state of polarization of a plane wave will, in general, be changed 
upon transmission through or reflection by a multilayer dielectric at 
oblique incidence. The reason for this depolarization is that the two 
eigenmodes of polarization, 4-field perpendicular to the plane of inci- 
dence and &-field parallel to the plane of incidence, generally have 
different reflection coefficients and different transmission coefficients. 
This is not desirable in many applications. For example, such depolari- 
zation could cause crosstalk in dual-polarization optical or quasi- 
optical components such as interference filters,’ directional couplers?~* 
and diplexers,® or mode conversion in beam-splitter-type hybrids’° 
in oversized waveguides. 

This paper provides solutions for multilayer dielectrics whose reflec- 
tion and transmission coefficients are independent of the state of 
polarization of the incident wave at a given angle of incidence and a 
given frequency. Compromising the degree of polarization indepen- 
dence to allow for a broader bandwidth of operation or for variations 
in the angle of incidence, even though desirable in many practical ap- 
plications, is not discussed. 
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The paper is primarily concerned with symmetric, multilayer di- 
electrics surrounded by the same medium. However, the case of differ- 
ent input and output media and antireflective coatings is also dis- 
cussed. Some of the results given in this paper have been previously 
reported by Kard,* Baumeister,? and Rabinovitch and Pagis. Their 
work is referred to in appropriate places in the text. 

There are other types of structures, besides the ones reported in this 
paper, that can give polarization-independent operation at oblique 
incidence. For example, artificial or natural anisotropic dielectric 
layers or metallic-wire meshes with rectangular, rather than square, 
cells can, in principle, be constructed to achieve this effect. 


Il. BASIC EQUATIONS 


Consider n adjacent isotropic dielectric layers of infinite transverse 
extent having relative dielectric constants of €1, €2, ---, én and uniform 
thicknesses of di, de, ---, dn inserted in a medium of a relative dielectric 
constant e,. Define 


Ki = €:/ €r, a=1,2,---,n (1) 


as the normalized dielectric constants of the layers. Let a uniform 
plane wave propagating in the medium be incident at an angle 6 on the 
layers as shown in Fig. 1. The incident, transmitted and reflected waves 
will have the same state of polarization if the #-field is perpendicular 





Fig. 1—n-layer dielectric inserted in a medium of a relative dielectric constant e;. 
The subscript h refers to the h-mode (#-field perpendicular to plane of incidence), 
and e refers to the e-mode (E-field parallel to plane of incidence). 
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to the plane of incidence (h-mode), or if the E-field is parallel to the 
plane of incidence (e-mode). To calculate the transmission and reflec- 
tion coefficients for the h- and e-modes, one can represent each layer 
by an equivalent transmission line."'!? The impedance of the line 
representing the zth layer normalized to that of the line representing 
the medium in which the layers are inserted is 


Zi = Zin = cos 6/(k; — sin? 6)? (2a) 
for the h-mode, and 
Zi = Lie = (x; — sin? 0)?/ (xk; cos 6) (8a) 
for the e-mode. The corresponding electrical length is 
b: = (20/d.)dive-(k; — sin? 6)? (4a) 


for both modes; \, being the free-space wavelength. If x; < sin? 4, Le., 
the total reflection condition is satisfied at the boundary of the zth 
layer, then the square root of (x; — sin? @) is imaginary and one can 
write 


Lin = JXin (2b) 
L sé as — JX se, (3b) 
oi = — jor, (4b) 


where X;n, Xie) and a; are positive, real quantities. In this case, cou- 
pling is accomplished across that layer by an evanescent wave. 
The normalized ABCD matrix corresponding to the zth layer is 


COs ¢; q sin o:Z; 
M; = . (5) 
j sin $,/Z; COS ¢; 
The overall, normalized ABCD matrix of the n layers is 
A 7B 
= MiM.---M,. (6) 
jC D 


The reason for the j in the off-diagonal terms of the matrix of eq. (6) 
is that A, B, C, and D will be real quantities if the system is lossless. 
Henceforth, this will be assumed to be the case. 

It is worth noting that reciprocity requires that 


AD + BC = 1, (7) 
and that for a symmetric structure 
A=): (8) 
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The overall field transmission coefficient, t, and field reflection co- 
efficient, r, are given by 


t= H,/E; = 2/[,A+ D+ 578+ 0)], (9) 
r=H,/E;=[(A—-—D+ j7(B-C)/[A +D4+  7(B+0)]. (0) 


A subscript of h or e should be added to all the variables in eqs. (5) 
through (10) to denote the particular mode being considered. 


Ill. CLASSIFICATION OF THE SOLUTIONS 


Our goal is to find solutions for the «’s, the d’s, and @ to achieve 
polarization-independent operation. In applications where one is 
restricted to independent h- and e-modes, or to unpolarized waves, it 
is sufficient that 

Jéx| = [te], [ral = rel. (11) 


To satisfy these conditions, eqs. (7), (9), and (10) require that 
Aj + Be+ CR + Di = AZ + BE+ C7 + Dé (12) 
For symmetric structures, eqs. (7) and (8) reduce eq. (12) to 
B,—C,r = +(B. — C.). (13) 


To preserve the state of polarization of an arbitrarily polarized 
incident wave, condition (11) is not sufficient. Instead, it 1s necessary 
that 

i, = ble, Th = fe. (14) 


The sense of polarization (i.e., the right- or left-handedness) of the 
transmitted wave will be unchanged if t, = +¢, and will be reversed 
if ¢, = —t,. On the other hand, the sense of polarization of the reflected 
wave will be unchanged if r, = —r, and will be reversed if r, = +7¢. 
The latter case is similar to reflection by a perfect, conducting plane. 

The four combinations of signs in eq. (11) are listed in Table I where 
they are referred to as Cases 1 through 4. The required relations 


Table | — Four cases for polarization-independent operation 

\ Case 1 2 3 4 

i,= te be —t, —t, 
T, = ts —Te T. —Te 
A,= Ae D. —A, —D,. 
B,= B, C, —B, —C;, 
Crh= C; B, —C. —B, 
D,= D. A. —D, —Le 
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between A;, Br, Ci, Dn, and A., B., C., D. to satisfy each of these cases 
can be deduced from eqs. (9) and (10) and are also given in Table I. 

In addition to classifying the solutions in the manner of Table I, 
another classification based on the numerical values of the x’s is useful. 
To be concise, let kmin be the smallest value of the x’s and let it be at 
the mth layer, 1.e., 


Kmin = Min (kj, Ka, ***, Kn) = Km. (15) 


If kmin 2 1, then none of the «’s is smaller than unity and the solution 
can be realized by dielectric layers, e; 2 1, inserted in air, e- = 1. This 
will be referred to as a Type A solution. On the other hand, if kmin < 1, 
the surrounding medium can no longer be air since this would require 
that at least em be less than unity (recall that m is the layer with the 
smallest value of x); thus, a dielectric-prism realization as shown in 
Fig. 2 is necessary. In this case, the mth layer can be an air separation 
ém = 1, and thus, from eq. (1) 


é¢ = ki/Kmin 2 I, é> = 1/kmin > 1. (16) 


The prism realization will be referred to as a Type B solution when 
1 > kmin 2 Sin? 0, and a Type C solution when kmin < sin? 6. The dis- 


— 
















7 
<= 
Sa a a eS ae ee ee 








_Fig. 2—A prism realization having the mth layer (the layer with the smallest 
dielectric constant) made of an air separation. Note that km = kmin < 1. Matching 
layers are required at the external surfaces of the prisms. 
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Table !|— The three types of solutions 


Type A B C 
Condition ony | sin? 6 S kmin <1 | kmin < sin? 6 
Description Dielectric slabs in Prism realization | Prism realization 

air (e, = 1) is (e, > 1) is (er > 1) is neces- 

possible. necessary. No sary. Evanescent 
evanescent waves exist. 
waves. 


tinction is made because in a Type C solution, the condition for total 
reflection is satisfied at the boundary of the mth layer and coupling 
across that layer is accomplished by an evanescent wave, aS mentioned 
in Section II. This is not the case for a Type B solution. Table II 
summarizes the conditions for the three types of solution. 

Dielectric-prism devices have been used for many optical and 
quasi-optical?—>:5—18 applications. When the total reflection condition 1s 
satisfied, Type C realization, the device is often said to be of the frus- 
trated-total-reflection type. 

Fresnel reflections at the outer surfaces of the prisms might cause 
undesirable effects.!? However, this can be avoided by matching each 
of these surfaces by the familiar \/4 transformer, or by other 
means.?:?°-2 In any case, for the purpose of this paper, the method of 
matching should be independent of the state of polarization of the 
incident wave. Thus, Brewster angle matching!”)!§ would not be suit- 
able here. 

In the remainder of the paper, a combined reference to the four 
cases of Table I and the three types of Table II is adopted. For ex- 
ample, Case 1A refers to Case 1 of Table I and Type A of Table II. 


IV. SINGLE-LAYER SOLUTIONS 


It can be shown by using the procedure outlined in Section II that 
no solution exists for a single layer to satisfy any of the four cases 
indicated in Table I. However, it is possible to satisfy the weaker con- 
dition given in eq. (11) by requiring that 


Ky = &,/e, = sin? 6/(1 + cos? 8), (17) 


where the symbols are defined in Fig. 1 with n = 1. Substituting eq. 
(12) into eqs. (2) through (10), one obtains 


th = te, He =F, (18) 


where the asterisk denotes complex conjugation. 
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It is clear from eq. (17) that x1 < sin? 6 and, hence, from Table II, 
the solution 1s of Type C, and a prism realization is necessary. 

It is interesting to note that condition (17) is independent of the 
thickness, di, of the layer. Thus, varying d; will change the reflection 
and transmission coefficients without affecting the polarization inde- 
pendence. If R = |r|? is the required power reflection coefficient, then 
eqs. (17) and (2) through (10), with n = 1, give 


dy = [do/(24rVex) JD + «1)/(1 — «) F* 
X sinh! {2[kiR/(1 — R)}/(1 + x1}. (19) 


A continuous variation of di, and hence of R, can be accomplished 
most conveniently if the layer is simply an air separation, e: = 1. In 
this case, if the angle of incidence is @ = 45°, which is convenient for 
mechanical considerations, eq. (17) requires the dielectric constant 
of the prisms to be e, = 8. On the other hand, if the prisms were made 
of polystyrene, e¢, = 2.54, eq. (17) requires that @ = 48.73°. 


V. SYMMETRIC THREE-LAYER SOLUTIONS 


It was mentioned in the previous section that no single-layer solution 
exists that satisfies any of the four cases of Table I. The same is also 
true for two layers. On the other hand, one can satisfy any of the four 
cases with three or more layers. In this section, we give all the solutions 
for symmetric three-layer structures, Le., 


K3 = Ki, ds = dy. (20) 


Even though unsymmetric three-layer structures can have polariza- 
tion-independent solutions, they will not be discussed here because of 
the algebraic complexities involved. 

Using the terminology of Section II, it can be shown that the 
normalized ABCD parameters of a symmetric three-layer structure 
are given by 


A = D = cos ¢2 cos (2¢1) — $[Z1/Z2 + Ze/Z1 | sin ¢2 sin (241) (21) 


B = Z, cos $2 sin (2¢1) + Zs sin 2 Cos’ g1 
— [47/Z2] sin d2 sin’ di (22) 


C = Z;' cos ¢@2 sin (261) + Zz' sin 2 cos’ 1 
— [Z.,/Z7] sin ¢2 sin? di. (23) 


Combining these equations with eqs. (2) through (4), (9), and (10), 
one can obtain solutions for the cases listed in Tables I and II. In 
Cases 3 and 4 of Table IJ, this process involves considerable algebraic 
manipulation. The results are given below. For simplicity, expressions 
for ¢1 (or ai) and @2 (or az) are given instead of expressions for d; and 
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d». Also, we define 


Pi (kK; = sin? 6)?, = l, 2 


Il 


qi = (sin? 6 — x,)}, i he” 
f(x,y) = 2xy — (a + y) sin? 4, 

R= |ril? = [rel’, 

S = [R/(1 — R)}}, 

U = (2VR/(1 + VR)]}, 

V = (1 — VR)/( + VR). 


(24a) 
(24b) 
(25) 
(26) 
(27) 
(28) 
(29) 


In each case, necessary upper and/or lower bounds on «1, ko, 0, and R 
are given. Because of the algebraic complexity involved, the bounds on 
k, and xo in Cases 3 and 4 are looser than those given in Cases 1 and 2. 


Case 1A: 


di = (2m + 1)7/2, m =0,1,2,---. 
ky = «i/[1 — («1 — 1)?/cos? 6]. 
sin @: = +28/[Vko/K1 — K1/Vko]. 
1+ Ucos@S«1<1-+ cos @. 
(1+ Ucos 0)2?/V S kz < o. 
This solution exists for all R, 6. 
Case 1B: 
$1, Kz, @2 are the same as in eqs. (30) through (382). 
sin? 6 < «1 S$ 1— Ucos8@. 


sin? @ << xe S (1 — U cos @)?/V. 


1 < «i/k2 S max [2/(1 + sin 6), V/(1 — U cos 8) }. 


This solution exists if and only if 
sin @ < V2, 


or, equivalently, 
R < [cos? 6/(1 + sin 26) ]?. 


Case 1C: 


No solution exists. 


(30) 
(31) 
(32) 
(33) 
(34) 


(35) 
(36) 
(37) 


(38a) 


(38b) 
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Case 2A: 


o, = (2m + 1)7/2, m=0,1,2,-+-. (39) 

Ke = «3. (40) 
sin d2 = +28 cos Opipe/[ (ki — 1) sin 6 |’. (41) 
Ki >Kka (>1), (42) 

ke >«2 (>1), (43) 


where x, is the root larger than unity of the quartic equation 


(W — 1)x* + 2(2 — W sin? 6)x® — (6 + W sin? 6 cos? @)x? 
+ 2(2 + W sin‘ @)x — (1 + Wsin* #) = 0, (44) 


where 
W = [2S cos 0/sin? 6 /?. (45) 
This solution exists if and only if 
sin é@ > U, (46a) 
or, equivalently, 
R < [sin? 6/(1 + cos? 6) J. (46b) 
Case 2B: 
$1, Ke, @2 are the same as in eqs. (39) through (41). 
sn@<«kSx«, (<1), (47) 
sin? 9 < kp Sx, (<1), (48) 


where x» 1s the positive root less than unity of the quartic equation (44). 
This solution exists for all R, 6. 


Case 2C: 
$1, kz are the same as in eqs. (39) and (40). 
sinh a2 = 2S cos 6p%qe/[ (1 — x1) sin @ P. (49) 
sin? @ < xi < sin @. (50) 
sint 9 < ke < sin? @. (51) 


This solution exists for all R, 0. It has been previously found by 
Kard.8 


Cases 3A, 3B: 


No solutions exist. 
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Case 8C: 
cos (261) = f(ki, 1) f(k1, K2)/L(k1 — 1) (ki — kz) sint 6]. 


(Lee K1) (ky — Ke) sin? 6, Y — (ko + X?*) cos? 6 |? 


S = Vkei(k2 — X) cos 6/[Y — (ko + X?) cos? a}, 


sinh ay = 


where 
X = f(k1, «2)/[(1 + #1) sin? 6], 
Y = ko(1 — xe) (ky — 1)?/(1 +p). 
Ki > {1 +38 cos? 6 + [8 cos? 6(1 + cos? 6) ]?}/sin? 6. 
sin? 6 cos? 6/(1 + cos? 6) S xz S sin? 6/(1 + cos? 8). 


This solution exists if and only if 


sin 6 > (2/8)H[(1 + 32/2782)? + 1} 


— [1 + 32/278?)? — 1 }*}3, 


or, equivalently, 
R > 8 cos? 6/(8 cos? 6 + sin® 6). 


Cases 4A, 4B: 
No solutions exist. 


Case 4C: 
cosh 2a, = (1 + x1) (ke + x1)/E(1 — £1) (k2 — £1) J. 


QV ix ko(1 + Kx) QD 
F(k1, Ke) DCL + ke) (KT A ke) |? 


S = Vei(ke — X)/[cos Oq1(k2 + X?)*), 


tan do = 


where X is given in eq. (55). 
kK, < sin? @. 
Ko > sin? @. 
This solution exists for all R, 6. 


Discussion: 


(52) 
(53) 


(54) 


(55) 
(56) 
(57) 
(58) 


(59a) 


(59b) 


(60) 


(61) 


(62) 


(63) 
(64) 


The above exhausts all possible solutions for a symmetric three-layer 
dielectric with no polarization dependence. The choice of the par- 
ticular solution to implement in a given problem depends on the desired 
values of R and 6, the available dielectric materials, and whether 
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having dielectric layers surrounded by air (Type A solution) or a 
prism realization (Type B or C solution) is more suitable. 

Cases 1A and 2A, which are the only possible Type A solutions, 
require large values for the dielectric constants and/or the angle of 
incidence to achieve moderate or large values of R. This point is 
illustrated for R = 0.5 in the design examples given in Section VIII. 

In Case 2, the required relation between x; and xz given in eq. (40) 
has an advantage for the prism realizations in Cases 2B and 2C. In 
these two cases, since kz < x; < 1, eqs. (16) and (40) give e, = 1, 
€. = K1/K2, and e, = 1/x. = ej. Thus, e; can also be used as a quarter- 
wave transformer to match the outer surfaces of the prisms. 

As in the single-layer prism solution given in Section IV, one can 
vary Ff in Cases 1 and 2 without affecting the polarization independence 
by simply changing d.. This is particularly convenient in the prism 
realizations in Cases 1B, 2B, and 2C with e, = 1 (air) since, in this 
case, d, can be changed continuously. Case 2C is unique in that one 
can obtain any value of R from zero to unity by varying d, from zero 
to infinity. In Cases 1A, 1B, 2A, and 2B, varying d, results in a periodic 
variation of R between zero and a maximum value less than unity. 
Varying d; or dz in Cases 3 or 4 results in a polarization-dependent 
operation. 

Cases 1A, 2B, 2C, and 4C have solutions for all R and @. This is not 
true for the remaining three cases, 1B, 2A, and 3C, as indicated by eqs. 
(38), (46), and (59). However, one should note that the numerical 
values of €,-, €1, and e:, and not necessarily the limitations on # and 86, 
often determine whether or not any particular case can be used in 
practice. 

In Case 4C, eq. (46) indicates that one can have x2 = 1. In this case, 
given that x, <1 from eq. (63), eq. (16) gives eg = 1 and «6 = & 
= 1/x,. Thus, the prism solution can be realized by using one dielectric 
material with air separations. This is not true for any of the other cases. 


VI. SYMMETRIC MULTILAYER SOLUTIONS 


When there are more than three layers, an exhaustive analysis 
similar to that given in the previous section becomes quite involved. 
However, two relatively simple classes of solutions for a symmetric 
multilayer dielectric with no polarization dependence can be obtained 
by generalizing Cases 1 and 2 of the symmetric three-layer dielectric. 
First, one should note that symmetry requires that the number of 
layers. n be odd, Le., 


peop. Pee. tee (65) 
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Thus, from symmetry 
Ki = Knyri, dy = dni, 4=1,2,---,0. (66) 
Let us assume that the electrical lengths of the first, and thus also 
the last, J layers are odd multiples of 7/2; i.e., 
d: = (2m; + 1)7/2; m, = 0,1,2,---, 7=1,2,---,l. (67) 


While this is a necessary condition to obtain solutions for Cases 1 and 2 
of a symmetric three-layer dielectric, as indicated in the previous 
section, eqs. (30) and (89), it is not necessary in general. However, 
this assumption is employed here because it simplifies the analysis 


considerably. 
Define the function g; of the vector x = {21, te, +++, Ti41} as 
g(x) = ah?l( TT D/C IL 2%. (68) 
todd Sl teven Sl 
Thus, gi(xX) = 27/22, go(x) = x?a3/25, ---, etc. Hach x; can assume the 


value of Zin, Zie, Ki, OF vi; (which will be defined later). From eqs. (66) 
and (67) and Section II, it can be shown that the normalized ABCD 
parameters of the symmetric multilayer dielectric are given by 


A = D = (—1)' cos ($141), (69) 
B= (—1)' sin (br41)9:(Z), (70) 
C = (—1)' sin (6141)/91(Z). (71) 


Since #741 is the same for both modes, eq. (69) shows that A, = D, 
= A. = D.. Thus, from Table I, the above equations can give solutions 
for Cases 1 and 2 provided that the conditions for the B’s and the C’s 
are met. Before satisfying these conditions, however, we note from eqs. 
(2) and (8) that 

LinLie = 1/«; (72) 
and define 
Vv, = Lie/ Lin = (Ki — sin? 6) / (Kk; cos? 0). (73) 


As x; assumes the increasing values of zero, sin? 6, 1, and + ~, then »; 
assumes the increasing values of — «, 0, 1, and sec? 6, respectively. 
From eqs. (9), (10), and (65) through (73), and from Table I, one 
can obtain solutions for Cases 1 (B, = B., C, = C.), and 2 (B, = C., 
C, = B,.). The results are given below. Because of algebraic complexity, 
each case has not been subdivided into Type A, B, and C solutions. 
Also, for the same reason, no bounds are given on the x’s, 6, or R. 


Case 1: 
gi(v) = 1. (74) 


sin di41 = +28/{Lgi(«) }*? — Lgr(«) 4}. (75) 
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Case 2: 
gi(n) = 1. (76) 


sin di41 = +2S/{Lgi(v)]? — Lgr(v) 3}. (77) 


In eqs. (75) and (77), S is the same function of R defined in eq. (27). 
Note that the roles of the «’s and the v’s are interchanged in the two 
cases. 

For a solution of any particular case to be valid, the value of sin ¢141 
should either be real with a magnitude not exceeding unity, which 
leads to a Type A or a Type B solution, or be imaginary, which leads 
to a Type C solution. The latter solution is not possible in Case 1 be- 
cause the x’s are real positive quantities, and thus the right-hand side 
of eq. (75) is always real. On the other hand, since the »v’s, and in par- 
ticular vz,1, can be negative, a Type C solution is possible in Case 2. 

As in the previous section, solutions for Cases 1A, 2B, and 2C in the 
present section are possible for all values of R and @. 

In Cases 2B and 2C, when x;,1 does not exceed «1, ko, +++, Ki, Or Unity, 
eq. (16) shows that one can have a prism realization with e141 = 1, 
€. = kKi/ Kina, ++ °,) €1 = K1/Ki41, and e, = 1/x141. Using eq. (76), one can 
show that / quarter-wave layers with dielectric constants €1, €2, ---, € 
will match the outer surfaces of the prisms. 

In practice, it is desirable to construct the multilayer dielectric 
reflector with the least number of different dielectrics. Thus, let us 
assume that.two dielectrics with normalized dielectric constants x; and 
ke are used alternatively ; 1.e., 


Kj = Ki, 7 odd, (78a) 
Ky = Ko, even. (78b) 


Combining this assumption with those in eqs. (65) through (67), the 
general solutions given in eqs. (74) through (77) simplify considerably. 
Because of its practical importance, we restrict ourselves to Type A 
solution, where «x; and x2 are larger than unity. No solution exists if 
either x: or Ks 18 equal to unity. 


Case 1A: 
ky = sin? 6/{1 — [(x1 — sin? 6)/x, 4!" (cos 02/44. (79) 
sin @iyy1 = 22S8/[(«2/«gb1)? — (wth t/nh)?]. (80) 
ky < sin? 6/[1 — (cos 6)?/(+) ], (81) 
K/h < Vi (82) 


This solution exists for all R, 6. 
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Case 2A: 


ee «a (83) 
sin @iy1 = 428/[(vih7/05)* — (oh/o47 4) F]. (84) 
v/v} < Vz (85) 
This solution exists 1f and only if 
siné@ > U, (86a) 
or, equivalently, 
R < [sin? 6/(1 + cos? 6) }?. (86b) 


The quantities S, U, and V used in the above equations are defined 
in eqs. (27) through (29). Of course, when / = 1, 1.e., n = 3, the above 
solutions reduce to Cases 1A and 2A given in the previous section for 
symmetric three-layer dielectric. 


Vil. CASCADING SOLUTIONS 


It may happen that some required values of the reflection coefficient 
cannot be obtained by any of the solutions given in the previous two 
sections because of limitations on the values of the dielectric constant 
available in practice, or because the resulting angle of incidence 1s not 
sultable for a particular application. In this case, the desired multi- 
layer dielectric may be obtained by cascading two or more solutions, 
provided that they have the same e, and the same 86. 

We now proceed to find the rules for proper cascading to maintain 
polarization-independent operation. Let the two solutions to be cas- 
caded be labeled x and y. They are not necessarily symmetric or 
identical. Let ¢ = 27d cos @/X be the electric length between x and y 
(see Fig. 3). Dropping the subscripts h and e, which denote the par- 
ticular mode being considered, we define rz, ’z, tz, Ty, Ty, and ty as the 
reflection and transmission coefficients of x and y in the directions 
specified in the figure. The transmission coefficients ft, and t, do not 
depend on the direction of propagation because of reciprocity and 
because the medium is the same on both sides of each of x and y. 
Because of losslessness, the difference between r, and r, or between 
ry and r, is a phase factor. Of course, if x is symmetric then r, = rz, and 
if y is symmetric then r, = 1,. 

Using the method of successive reflections,”’ it can be shown that for 
a wave incident on the x side of the cascade, the overall field trans- 
mission coefficient, ¢, and field reflection coefficient, r, are given by 


t = tty exp (— jo)/L1 — rary exp (— j2¢) ], (87) 
r=r, + tary exp (—j2¢)/[1 — rar, exp (— j29)], (88) 


1040 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1975 


Assit Ay 
.eneeen seuss’ r Attithttty 
AT y A 
,easeseuunan: sepeonsenae 
AT I —e—e—e—t a cm e 
,asscecccena’ Ant thy 
tet S| eee + ee ceaeuenes 
6” AS t Pestestiisats 
ASS Ay t 
Ay : Ty y 
Pay 8 Ay 
A TTISITI Titi, 5 . 
ATTHITE THI Attttttttry 
roseceseses °64 Atti ttttty 
Attrtttttt yy Pmrtttttity 
,OSeeeueuseur Sa Seoaene 
-S88eeeceuucggv t seence ‘esce t 
; rsaseesserenes x At a 
x ; ty 
yaseususucest AD 
.seuscsososer /Gauseseuecs: 
Gescsesecses Atty 
ATTTITITIL Ie 0 0ti‘“‘i‘iCOS EI 
-seueeesnsea’ O9esesuaaae 
' 2eussaeeuese’ susesesacas 
r sgesiessstecg ersetastic 
ATitreTitiit yp r Og Tt 
Atrrag crit x mm A, ans 
a uses’ Airitt “ty 
oo, ease o 
H ‘\ 
\ y 
d ‘ 
\ 
A 


Fig. 3—Cascading of two polarization-independent solutions z and y to obtain an 
overall polarization-independent solution. 


where we recall that a subscript of A or e has been dropped from all the 
t’s and r’s. 

Since x and y are polarization-independent solutions, then, from eq. 
(14) or Table I, reary, = 1eelye. Only the positive sign is acceptable, 
otherwise the magnitude of the denominators in eqs. (87) and (88) 
would be different for the h- and e-modes, which will result in a polari- 
zation-dependent operation for the cascade. Thus, the sign of r./r;, 
should be the same for all the cascaded sections; 1.e., one can only have 
a combination of Cases 1 and 3 or Cases 2 and 4. 

Let the abbreviation 7*7 = k mean: ‘‘Cascading Cases 2 and 7 gives 
Case k.”’ It can be shown from eqs. (87) and (88) and Table I that 


Llxl = 3%38 = 1, 143 = 3x1 = 8, (89a) 

2x2 = 444 = 2, 24 = 4%2 = 4, (89b) 

Let the required overall power reflection coefficient of the cascade be 
R= [rl (90) 


and let it be realized by two identical cascaded sections each having a 
power reflection coefficient R,. Then, one can write 


Tr = Ty = Rie’*:, (91) 
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where ¢, 1s the phase angle of r,. This can be found given the e’s, d’s, 
and 6 by using eqs. (1) through (10). Substituting eq. (91) in eqs. (87) 
and (88), and defining 


v = ¢: — , (92) 

one obtains 
R=([1+ (1 — R,)*/(4R, sin? y) |. (93) 
Thus, for a given value of R,, one can obtain any value of F satisfying 
OS Rk $4k./(1 + R,)*. (94) 


Note that the upper bound on R is larger than R,. Conversely, for a 
given value of R, one can use any value of FR, satisfying 


[1 — (1—R)'P/R < R. S11. (95) 
The higher the value used for FR, the smaller is the overall bandwidth. 


Vil. DIFFERENT INPUT AND OUTPUT MEDIA, AND ANTIREFLECTION 
COATINGS 
Often, especially at optical wavelengths, the multilayer dielectric is 
supported by a substrate having a relative dielectric constant e, which 
is, in general, different from that of the input medium, e,. In this case, 
general analysis of polarization-independent solutions is complicated. 
However, it can be shown that special solutions similar to those for the 
symmetric (21 + 1)-layer dielectric given in Section VI can be easily 
obtained. For this purpose, let 6, as before, be the angle of incidence 
in the e, medium. Assume that there are /1(21) layers with relative 
dielectric constants €1, €2, ---, €: between the e, and e, media. The 
normalized dielectric constants 1, k2, -++, Kk, are defined by eq. (1). 
In addition, we define 
Kiz1 = €3/€r (96) 


Further, as was done in Section VI, assume that each of the / layers is 
effectively a quarter-wave thick at the given angle of incidence, i.e., 


¢; = (2m; + 1)7/2; m,; = 0,1, 2,---, 4=1,2,---,l. (97) 


With the above definitions and assumptions, it can be shown from 
eqs. (1) through (6) that the normalized input impedance as seen from 
the «, medium is given by 


Zin a gi(Z), (98) 


where Z = {Z,, Zs, ---, Zi41} and the function g; is defined in eq. (68). 
Thus, the reflection coefficient in the e, medium is given by 


f= (Zin —1)/ Gat 1). (99) 
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A subscript of A or e, depending on the particular mode being con- 
sidered, has been dropped from all the variables in eqs. (98) and (99). 
It is clear that if ki41 = €,/e, > sin? 6, then Zi41, Zin and r are all 
real quantities. In this case, it can be shown that the reflection coeffi- 
cient r, in the e, medium, and the transmission coefficient t across the 

l layers, are given by 
t= (a1) (100) 


and 


ep awe (101) 


where the m,’s are the integers defined in eq. (97). Since the impedances 
of the input and output media are different, ¢ is defined here as the 
scattering transmission coefficient, which is usually referred to as Sy». 
Using the same notation, r and r, could be referred to as Si: and Se», 
respectively. 

The coefficients r, and ¢ can not be defined if the total reflection con- 
dition, ki41 = €./e, < sin’ 9, is satisfied. In this case, Z)4.1 is imaginary 
and a nonpropagating evanescent wave exists in the e, medium. It 
also follows that Z;, is imaginary, and hence, from eq. (99), |r| = 1. 

It can be seen from eq. (99) that for re = 7, (Case 1), Zin,e = Zin.hj 
and that for re = —r, (Case 2), Zin,e = 1/Zin,n. Thus, combining eqs. 
(98), (68), (72), and (73), and defining R = |r.|? = |r,|? as the power 
reflection coefficient, one obtains the following solutions. 


Case 1: 
gi(v) = 1. (102) 
R= {[gi(x) ]* — 1}?/t Lge)? + 1Y?. (103) 

Case 2: 
gi(*x) = 1. (104) 
R= {Loi(v) Jt — 1}?/{Lor(v) J? + 1}. (105) 


The above solution of Case 2 has been previously obtained by 
Baumeister? and by Rabinovitch and Pagis.'° Both papers, however, 
overlooked the solution of Case 1. 

It is observed that eqs. (102) and (104) are identical to eqs. (74) and 
(76) for the symmetric (21 + 1)-layer dielectric. This should not be 
surprising since the latter can be obtained by cascading, back to back, 
two identical solutions of the type described in the present section. 
In fact, this is the reason that the conditions for polarization-inde- 
pendent operation given in Sections V and VI for Cases 1 and 2 are 
independent of the thickness of the middle layer. 
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It is clear from eqs. (102) through (105) that if 
gi(x) = gi(v) = 1, (106) 


one obtains R = 0. Thus, eq. (106) gives the conditions for polariza- 
tion-independent antireflection coatings at inclined incidence. It has 
been previously reported by Bauneister.? Rabinovitch and Pagis! 
have incorrectly stated that no such solution exists. It 1s interesting 
to note that the condition g;(x) = 1, which is independent of the angle 
of incidence, is the same condition on the dielectric constants that 
yields an antireflection coating at normal incidence. 

At least two layers, 1 2 2, are required to satisfy eq. (106). For 
| = 2, e, = 1 and for a given finite @, eq. (106) gives the unique 
solution. 

e = [ed + sin? 6+ (e, — sin? 6)? cos 6]/(e68 +1), (107) 


€2 = €,€}. (107b) 


For 1 > 2, the solution of eq. (106) is not unique, and we have extra 
degrees of freedom in choosing the dielectric constants of the layers. 
However, as pointed out by Baumeister,? the values of dielectric con- 
stants available in practice will often force a compromise of the 
effectiveness of the antireflection coatings. 


IX. DESIGN EXAMPLES 


For the purpose of demonstration and comparison of various solu- 
tions, let us design polarization-independent, multilayer, dielectric 
beam splitters with a power reflection coefficient R = 0.5 at 50 GHz, 
1.e., 4 = 6 mm. (The frequency is relevant only for the selection of 
the dielectric materials.) Such 3-dB beam splitters may be used in 
circular-waveguide, channel-dropping filters.’ 

All the solutions considered are symmetric, and all, with the excep- 
tion of the last solution, are of Type A with air as the surrounding 
medium; i.e., 

e- = I, 


and thus, from eq. (1) and Table II, it follows that 
Ki = « 2 l, 7=1,2,-°°,n. 


In three of the solutions, the frequency response of |7,|? and |r./?, in 
dB, and the phase difference 


Ad = phase(r,) — phase(r.) (mod 180°) 
= phase(t,) — phase (¢.) (mod 180°) 


are given. The second equality follows from losslessness and symmetry 
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Table !I!—- Some low-loss dielectrics at millimeter wavelengths 


. : Frequency 
Dielectric eras Loss tangent range of Reference No. 
Perey measurements 
Kearfott 
High-Purity 9.4 0.00017 14 to 50 GHz 24 
Alumina 
Polystyrene 2.54 0.0012 10 to 25 GHz 25 
Teflon a | 0.0005 50 to 70 GHz | Unpublished 
report 
Eccofoam PS" | 1.02 to 2.0 0.0004 10 GHz 26 


“This is a CO. foamed polystyrene whose density can be adjusted to yield any 
dielectric constant in the range specified. It is not clear how the Rayleigh scattering 
by the CO, bubbles will affect the loss tangent at frequencies higher than 10 GHz. 


since, under these conditions, the difference between the phases of 
r and ¢ 1s +90° at all frequencies as can be deduced from eqs. (8) 
through (10). 

The dielectric materials employed in our solutions will be chosen 
from those given in Table III. Because of their low loss, these materials 
are suitable for use at millimeter wavelengths. Other suitable dielec- 
trics can be found in Refs. 24 and 25. 


9.1 Symmetric three-layer, Case 1A solution (e1, 2, €:) 


With R = 0.5, eqs. (33) and (84) require that 1.64 S e, < 1.71 and 
e. 2 15.74 for 0 = 45°; 146 S$ a < 1.5 and 2 2 12.34 for 6 = 60°; 
or 1.23 Se < 1.26 and e, = 8.90 for 6 = 75°. It is difficult to sealine 
a solution at millimeter wavelengths for @ less than about 60° because 
of the high required value e2. However, for 6 75°, the value of e2 can 
be realized by Kearfott Alumina (e = 9.4). The corresponding low 
value of e; can be obtained by using Eccofoam PS, or foamed rubber 
such as Buna S Rubber whose dielectric constant can be extrapolated 
to be 1.26 at 50 GHz. Thus, with ey = 1.26 and e. = 9.4, eq. (31) 
gives 9 = 73.43°, eqs. (4a) and (80) give d1/A, = 0.428 + m1 X 0.856, 
and eqs. (4a) and (81) give do/A. = 0.078 + me. X 0.172, where m1 
and me. are integers. 

The frequency response of this solution is given in Fig. 4 for my = my» 
= 0. The large required value of @ makes this solution undesirable. 


9.2 Symmetric three-layer, Case 2A solution (e1, e, €:) 


With Ff = 0.5, eq. (46) requires that 6 > 65.53°; and eqs. (40) and 
(42) through (44) require that «, > 4.61 and e; = ef > 21.23 for 
6 = 70°; e > 2.09 and e. = ef > 4.388 for 6 = 75°; and e, > 1.58 
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Fig. 4—Frequency response of the solution in 9.1 [three-layer, Type 1A solution 
with 6 = 73.43°, e:=1.26 (Buna S Rubber), e2=9.4 (Kearfott Alumina), di/Ao = 0.428, 
and d2/d, = 0.078 ]. 


and e. = ej > 2.49 for 6 = 78°. In the last case, the solution can be 
realized by Eccofoam PS (e = 1.59) and polystyrene (« = 2.54). Just 
as in the solution in Section 9.1, the required large value of 6 makes this 
solution undesirable. For symmetric multilayer, Type-2A solutions with 
more than three layers of two dielectrics, eq. (86) shows that @ is still 
required to exceed 65.53°. 


9.3 Symmetric five-layer, Case 1A solution (e, €2, €:, e, €:) 


For R = 0.5 andl = 2,1.e., n = five layers, eqs. (79), (80), and (81) 
can be satisfied with e = e3 = 2.1 (Teflon), e = 9.4 (Kearfott 
Alumina), and 6 = 46.91°, which is a convenient angle of incidence. 
Further, eqs. (4a) and (67) give di/d\. = 0.200 + m, X 0.3899 and 
do/\. = 0.084 + mz X 0.168, and eqs. (4a) and (80) give d3/)., 
= 0.103 + ms X 0.399, where 1, m2, and ms; are integers. The fre- 
quency response of this solution is given in Fig. 5 for my = my. = 
m3 = 0. 


9.4 Symmetric seven-layer, Case 1A solution (e1, 2, €3 €4) €35 €2 €1) 


For R = 0.5 and | = 3, 1.e., n = seven layers, eqs. (79), (80), and 
(81) can be satisfied with e: = e; = 2.54 (polystyrene), e. = e4 = 9.4 
(Kearfott Alumina), and @ = 47.52°. Further, eqs. (4a) and (67) give 
dy/Xo = d3/No = 0.177 + m, X 0.354 and ds/d, = 0.084 + me X 0.168, 
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and eqs. (4a) and (80) give ds/A. = 0.026 + m4 X 0.168, where mu, 
Mm, and m, are integers. This solution has a narrower bandwidth than 
the solution in 9.3. 


9.5 Symmetric seven-layer, Case 1A solution obtained by cascading two 
identical, symmetric three-layer, Case 1A solutions 
(e15 €2, €1, 1.0, €1, €2, €1) 

With R = 0.5, eq. (95) shows that R., the power reflection coefh- 
cient of each cascaded three-layer section, should not be less than 
0.1716; thus let us choose R, = 0.1716. In this case, eqs. (381), (33), 
and (84) can be satisfied with e, = 1.6 (Eccofoam PS), e = 9.4 
(Kearfott Alumina), and @ = 45.30°. Further, eqs. (4a) and (80) give 
di/X. = 0.239 + m, X 0.478 and eqs. (4a) and (82) give d2/dA. = 0.038 

To find the width d of the air separation, one should first find the 
phase ¢, of the reflection coefficient of each cascaded three-layer 
section. By substituting the values of €1, €2, di, dz, and @ obtained above 
in eqs. (2) through (10), one finds that ¢, = 43.59°. With R = 0.5 
and Rz = 0.1716, eq. (93) gives y = +90° (mod 180°), and hence, 
the electrical length of the air spacing between the two three-layer 
sections is found from eq. (92) to be ¢ = 133.59° + m X 180°, where 
m is an integer. Thus, from eq. (4a) with e, = x; = 1, d/rA, = 0.528 
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Fig. 5—Frequency response of the solution in 9.3 [five-layer, Type 1A solution with 
6= 46.91°, e = 6 = 2.1 (Teflon), «e = 9.4 (Kearfott Alumina), d:/A, = 0.200, 
d2/. = 0.084, and d3/A, = 0.103 ]. 
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Fig. 6—Frequency response of the solution in 9.6 [three-layer, Type 2C prism solu- 
tion with 6=45°, e,=2.54 (polystyrene), e: =1.59 (Eccofoam PS), e=1, di/Ao= 0.439, 
and d2/d\, = 0.236 ]. 


+m X 0.711. This solution has a narrower bandwidth than the solu- 
tion in 9.3. 


9.6 Symmetric three-layer, Case 2C prism solution (e,, 4, 1.0, e, er) 


Choosing 6 = 45° and es = 1.0, eqs. (16), (40), (50) and (51) give 
1.414 < e <2 and 2 < «e, = e < 4. Thus, the solution can be ob- 
tained with e; = 1.59 (Eccofoam PS) and e, = & = 2.54 (polysty- 
rene). Further, eqs. (4a) and (39) give di/\. = 0.489 + m X 0.879, 
where m is an integer, and with R = 0.5, eqs. (4a), (4b), and (49) 
give d2/\, = 0.236. The frequency response of this solution is given 
in Fig. 6. for m = O and under the assumption that the outer surfaces 
of the prisms are perfectly matched. As mentioned at the end of Sec- 
tion V, the matching can be accomplished by using a quarter-wave 
layer having a dielectric constant e, = 1.59 (Eccofoam PS). 
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An engineering method ts proposed for calculating rain attenuation 
distributions for frequencies greater than 10 GHz and for paths of arbitrary 
length. The technique 1s based upon the observed approximate lognormality 
of rain attenuation and rain rate statistics within the range of interest; 
at reflects local meteorology through incorporation of the observed point 
rain rate distribution. Some rmportant parameters in the resulting formulas 
are determined empirically from experimental data. Sample calculated - 
results agree well with available experimental data from Georgia, New 
Jersey, and Massachusetts. This new technique may prove useful for 
engineering radio paths at frequencies above 10 GHz. Sample calculations 
of expected outage probability are given for 11- and 18-GHz radio links 
at Atlanta, Georgia, as a function of repeater spacing and transmission 
polarization. 


I. INTRODUCTION 


An important problem in designing radio relay systems at fre- 
quencies above 10 GHz is the radio outage caused by rain attenuation. 
Determination of the appropriate radio repeater spacings for economic 
and reliable operation requires a knowledge of the probability dis- 
tribution of rain attenuation as a function of repeater spacing at 
various geographic locations. Most available data on rain rate sta- 
tistics are measured by a single rain gauge at a given geographic loca- 
tion. A procedure for calculating a rain attenuation distribution from 
a point rain rate distribution is, therefore, needed. 

The results of rain gauge network measurements! ® indicate, how- 
ever, that the measured short-term distributions of point rain rate 
vary significantly from gauge to gauge. For example, at Holmdel, 
New Jersey, there was considerable variation! among the measured 
point rain rate distributions obtained from 96 rain gauges located in 
a grid with 1.3-km spacing over a 6-month period. Among these 96 
distributions, the incidence of 100-mm/h rains is higher by a factor of 
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5 for the upper quartile gauges than for the lowest quartile (see Fig. 
32, Ref. 1). Data from a rain gauge network in England® indicate 
that even with a 4-year time base and averaging over observations by 
four gauges with I-km gauge spacing, the four-gauge-average rain 
rate incidence can differ by a factor of 3 for rain rates above 80 mm/h, 
depending on which four gauges are chosen for averaging. This means 
that, on a short-term basis, the relationship between the path rain 
attenuation distribution and the rain rate distribution measured by a 
single rain gauge is not unique. The prediction of a path rain attenua- 
tion distribution from a point rain rate distribution is, therefore, 
meaningful only if the time base is sufficiently long to yield stable, 
representative statistics. Accordingly, knowledge of the long-term 
statistical behavior of point rain rate and path rain attenuation is 
essential for radio path design. 

The available experimental rain data (Appendix B, Figs. 10 to 13, 
and Ref. 6) indicate that the long-term distributions of point rain 
rate # and rain attenuation a are approximately lognormal within 
the range of interest to designers of radio paths using frequencies 
above 10 GHz. This paper describes a method for calculating rain 
attenuation distributions based upon this lognormal hypothesis. 

A lognormal distribution is uniquely determined by three param- 
eters (see Section 2.1). A set of equations are derived to relate the 
lognormal parameters of attenuation a to those of point rain rate R. 
Thus, given a long-term, representative distribution of point rain rate 
at a given geographic location, the rain attenuation distribution for 
any path length of interest can be calculated. The method is outlined 
as follows. 

The available theory’—® for converting rain rate RF into rain attenua- 
tion gradient 8 in dB/km is appropriate to spatially uniform rain 
rates, whereas actual rainfalls are usually not uniform over an entire 
radio path. To apply the uniform rain theory, the radio path volume 
is divided into small incremental volumes AV, in which the rain rate 
is approximately uniform. The rain rate R in each small volume AV is 
associated with a corresponding attenuation gradient 6 by the uniform 
rain theory. The total path attenuation @ is then the integral of 6 over 
the path volume. 

If the spatial distribution of the attenuation gradient @ were uniform, 
the path attenuation at a given probability level would increase 
linearly with path length. On the other hand, if the spatial distribution 
of 8s were not uniform and the time fluctuations of the 6s were sta- 
tistically independent, the incremental attenuation contributed by 
each AV would sum on an rms basis. Intuitively, we feel that the 
attenuation gradients at two different positions are partially corre- 
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lated, with a correlation coefficient that is a decreasing function of 
the spacing between the two positions. This behavior is described by 
introducing a spatial correlation function for 6. This spatial correlation 
function is used in the calculation of lognormal parameters of path 
attenuation a from those of attenuation gradient 8. 

In this formulation, the appropriate incremental sampling volume 
AV is of the order of 1 m* and the corresponding appropriate rain 
gauge integration time about 2 s, requiring, therefore, 2-s point rain 
rate distributions (see Section 2.3). 

Tables II and III and Figs. 5 to 8, discussed in Section IV, present 
comparisons of calculated and measured attenuation distributions. 
The satisfactory correspondence appears to validate the method of 
calculation. Section V and Fig. 9 present the calculated results for the 
outage probabilities of 11- and 18-GHz radio links in Georgia. Section 
VI discusses some qualifications to the methodology. 

Supporting material and mathematical derivations are given in 
Appendices A to D. Appendix E lists symbols and their definitions. 


ll. BASIC DEFINITIONS AND FORMULATION 
2.1 Lognormal distributions of attenuation and rain rate 


The equations approximating the rain attenuation distribution and 
point rain rate distribution are: 


Pla(L) = A|]=~ P)(L)-derfe née | (1) 
and 
] —InR,, 
P(R 2 r) > P,(0)-4erfe er a ; (2) 


where erfe (~) denotes the complementary error function, In (~) 
denotes natural logarithm, S, and Ser are the standard deviations of 
In a and In R, respectively, during the raining time, am and R» are the 
median values of a and R, respectively, during the raining time, Po(L) 
is the probability that rain will fall on the radio path of length L, and 
P,(O) is the probability that rain will fall at the point where the rain 
rate R is measured. The definition and the determination of raining 
time, and hence Po(Z) and P,(0), are discussed in Section 2.5. The 
measured distribution of point rain rate is a function of rain gauge 
integration time 7’.4-?° The appropriate integration time is about 2s 
in our formulation as discussed in Section 2.3. 


2.2 Radio path definition 


C. L. Ruthroff?! has defined a “radio path,” giving a physical as 
well as mathematical representation in the spatial volume significant 
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to propagation considerations (Fig. 1). In essence, the “radio path” 
corresponds to the first Fresnel zone, a prolate ellipsoid of revolution 
terminated at the ends by the transmitting and receiving antennas. 

Since the first Fresnel zone is circularly symmetric with respect to 
the path axis connecting the transmitting and receiving antennas, we 
adopt a cylindrical coordinate system (Fig. 1) coaxial with the path axis 
(z-axis) in the formulation. 

The radius h(z) and the circular cross section Q(z) (see Fig. 1) of the 
radio beam at a distance z from the transmitter are 


h(z) = — | (3) 
and 
Q(z) = rh?(z), (4) 


where J is the radio wavelength and L is the distance between trans- 
mitter and receiver. For example, at 18 GHz on a 5-km path, the 
average beam radius, the average beam cross section, and the radio 
path volume are about 3 m, 30 m?, and 150,000 m°, respectively. 


2.3 Path integral formulation for rain attenuation 


The spatial distribution of actual rainfall is usually nonuniform. 
The rain density, the point rain rate R, and the corresponding (point) 
rain attenuation gradient 8 are all functions of position (p, ¢, z) and of 
time (¢). The total rain attenuation a in dB incurred on a radio path of 
length L (Fig. 1) is calculated by integrating the incremental attenua- 
tion da along the = 


a(t) =f ‘Sa = Bo(z, tdz (5) 
= la mG : "f "Bp, b, % t)pdpdede, (6) 

where 
Bele) = ary ff. Bld %, Dpdpi 7) 


is the average value of B(p, ¢, z, ¢) over the radio beam cross section 
Q(z) at a distance z from the transmitter, and 


da(z,t) = Ba(2, t)dz (8) 


is the incremental attenuation experienced in the incremental segment 
dz at a distance z from the transmitter. 

To shorten the notations in the following equations, we use the 
vectors to denote the position (p, ¢, 2) and dv to denote pdpdd¢dz in the 
volume integration. 
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h(Z): RADIUS OF RADIO BEAM CROSS SECTION AT Z 
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Fig. 1—Configuration of a radio path. 


The rain density and the rain attenuation gradient 8 in dB/km are 
only meaningful with respect to a sampling volume AV large enough 
to contain sufficient raindrops to yield stable volume average quanti- 
ties. The results of measurements of rain density and rain rate by a 
photographic method” indicate that the typical rain density at 1-mm/h 
rain rate varies from 50 to 100 raindrops per m’, depending on geo- 
graphic location. This means, on the average, a 0.1-m* volume con- 
tains only 2 raindrops at 0.25-mm/h rain rate. Such a small sampling 
volume will not measure the rain rate in the conventional sense, but 
rather will “‘see’’ individual raindrops. Thus, for a meaningful mea- 
surement of rain rate below 1 mm/h, the sampling volume should be 
at least 1 m’. 

On the other hand, the available theoretical results’ relating 6 
and FR assume that the rain density is uniform within the volume 
of interest. To use the available functional relationship B(R) to 
convert the statistics of R into that of 8, the sampling volume AV 
must be sufficiently small so that the rain density (and the rain rate) 
is approximately uniform within AV. The observations by a capacitor 
flow rain gauge”?*4 and by a raindrop photographic method?® indicate 
that heavy rain has fine scale structure on the order of 1 m. This 
means AV should not be much larger than 1 m?. 

These two constraints indicate that the sampling volume should be 
on the order of 1 m*. We have chosen* 

AV 1m (9) 
in our formulation. 

“A choice of AV somewhat different from 1 m? is also possible. Since the spatial 
correlation coefficient of 8 depends on the sampling volume, the use of a slightly 
different AV would result in a slightly different characteristic distance G, defined in 


eq. (85) and determined in Section 4.1. For example, a larger sampling volume, with 
more smoothing effect, will result in a larger characteristic distance @ for B. 
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Therefore, in this paper, §(s, ¢) is defined as the rain attenuation 
gradient at time t, owing to rain in a 1-m*® sampling volume, AV, 
centered at s, and 6,(z, t) is the average value of 8(s, ¢) over the path 
cross section Q(z). 

If A, is the area of the collecting aperture of a rain gauge and Vp 
the average descent velocity of rainfall, then the appropriate rain 
gauge integration time 7',y to measure rain rate in a 1-m? sampling 
volume AY is 

AV 


A,: Vie to) 





Tay & 


For example, if A, = 0.073 m® (i.e., 12-in. diameter), then J'ay is 
about 2s, assuming Vr & 7 m/s. In the measurements of raindrop size 
distributions, Laws and Parsons?® have also used an integration time 
in the order of seconds during heavy rain. The Laws-and-Parsons rain- 
drop size distribution is the basis of most uniform rain theories for 
converting R into 6. We therefore define R(s, t) as the point rain rate 
measured by a rain gauge with integration time Tay, located at s. The 
shape of the 1-m* sampling volume defined by the rain gauge is cy- 
lindrical and is considerably different from that of the incremental 
AV. We assume that the long-term distributions of rain rates for these 
two different shapes of 1-m* sampling volume are approximately the 
same. 

Based upon these definitions, we postulate that the long-term prob- 
ability distribution of R(s, ¢) can be converted into the long-term 
probability distribution of G(s, ¢) by a relationship discussed in the 
next section. 

The integration times of most available point rain rate data are 
longer than the T,y (22 s) required by this formulation. The depen- 
dence of point rain rate distribution on the rain gauge integration time 
in the range 

15ss 7120s (11) 


has been determined by Bodtmann and Ruthroff™ for a 2-year (1971- 
1972) measurement at Holmdel, New Jersey. By using this experi- 
mental result and interpolation, we convert the available point rain 
rate distribution with T in the range (11) into a 2-s point rain rate 
distribution.’ 


2.4 Average relationship between rain rate and rain attenuation gradient 


The instantaneous relationship obtaining between the point rain 
rate R(s, t) and the corresponding rain attenuation gradient {(s, 2) 


* This relationship has not yet been demonstrated to be geographically independent. 
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Fig. 2—Theoretical relationship between rain rate and rain attenuation gradient. 


depends upon the particular distribution of raindrop sizes, shapes, and 
orientations, the speed and local direction of the wind, and the rain 
temperature. The average relationship, assuming uniform rain density, 
spherical raindrops, and Laws-and-Parsons drop-size distribution, has 
been calculated by Ryde and Ryde,!!-"™ Medhurst,® and Setzer.” 
Recently, Morrison, Cross, and Chu,’?? and Oguchi® have refined 
these calculations by including the effects of nonspherical raindrops. 
Figure 2 shows this theoretical relationship* for transmission frequen- 
cies of 11, 18, and 30 GHz. 

Many authors have pointed out that this average relationship be- 
tween the rain rate FR and rain attenuation gradient 6 can be approxi- 


mately described by 
B= yam, (12) 


where y(A) and 7(A) depend upon the radio wavelength » and the 
polarization of the radio signal. Table I lists the estimated y(A) and 
n(d) for 11, 18, 30, and 60 GHz. 


* In Fig. 2, the average of the absolute value of raindrop canting angle is assumed to 
be 25 degrees. This angle has been found representative by Chu (Refs. 28 and 29) in 
comparisons of calculated results with experimental observations (Refs. 30 and 31) of 
the differential rain attenuation experienced by horizontally and vertically polarized 
signals on the same radio path. 
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Table |— Parameters relating rain rate R and rain attenuation 
gradient pg 
B=y:Ra B in dB/km R in mm/hour 


Y 7 
Frequency 

GHz) 

V-Pol H-Pol V-Pol H-Pol 
11 0.013 0.015 1.22 1.23 
18.1 0.05 0.054 1.11 1.14 
30 0.15 0.17 1.04 1.04 
60* 0.7" 0.7* 0.814* 0.814" 


* The 60-GHz parameters are estimated from results in Ref. 10 in which only 
spherical raindrops are considered. 


Taking logarithms of both sides of eq. (12) yields 
InfB=Iny+7-InR. (13) 


From this equation, if the distribution of point rain rate R is approxi- 
mately lognormal in the range of interest, then the distribution of 
attenuation gradient 8 will also be approximately lognormal (see 
Appendix B). The distribution of 6 can therefore be written as: 


oe ee | 


W2-Sp ws 


P(g = B)~ P(0)-Serfe | 


where 6, is the median value of 8 during the raining time and Sg is 
the standard deviation of In @ during the raining time. Furthermore, 
eqs. (12) and (13) imply the relationships 

Se = 1'Sr (15) 
and 


Equations (15) and (16) allow us to convert the lognormal distribu- 
tion (2) of RF into the lognormal distribution (14) of 8, and vice versa. 


2.5 Rainfall probability P.(0) and raining time 
In principle, the probability of raining, Po(0), is obtained as the limit 


lim P(R = © = P,(0). (17) 


e—>0+ 
An instant ¢ is considered to be raining time if the condition 


lim R(t) > (18) 


e>O0+ 


is satisfied. The lower cutoff threshold in most presently available rain 
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RAIN GAUGE NETWORK DATA OF 
FLORIDA THUNDERSTORM PROJECT 
(REF.20) 


EMPIRICAL CURVE DESCRIBED 
BY EQ. (20) 


ORLANDO,FLORIDA 
2—MINUTE INTEGRATION TIME 


PATH RAINFALL PROBABILITY P,(L) IN PERCENT TIME 





0 5 10 15 20 
PATH LENGTH L IN KM 


Fig. 3—Rain-gauge network data on path length dependence of rainfall proba- 
bility Po(Z). 


rate data is about 0.25 mm/h. Therefore, in practice, we approximate 
0+ in definitions (17) and (18) by 0.25 mm/h. The rationale for this 
approximation is twofold. 


(7) Rain rates below 0.25 mm/h have practically no significant 
effects on radio communication links at frequencies below 60 
GHz. 

(71) Rain rates below 0.25 mm/h cannot be measured accurately 
by most existing rain gauges with standard recording strip 
charts. 


At the present time, the probability P(R = 0.25 mm/h|T = 1 min) 
is available at only a few locations.'*:!8.2° For most locations, we can 
obtain P(R = 0.25 mm/h) with 1-h integration time from the Weather 
Bureau hourly precipitation data.*® The experimental results on the 
effect of rain gauge integration time 7’ on P (0) in Florida” and 
Japan!’ indicate that 


P(R 2 0.25 mm/h| 7 <S 1 min) 
~0.5-P(R 2 0.25mm/h|T = 1h). (19) 


Therefore, we use Weather Bureau data and approximation (19) to 
estimate Po(0) at several locations of interest where direct measure- 
ment of Po(0) with 1-min integration is not available. 
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Intuitively, we expect the probability Po(L) of rainfall on a radio 
path of length LZ to increase with L, since a longer path has a higher 
chance of intercepting rain of limited extent. From the rain gauge 
network data (Fig. 3) of the Florida Thunderstorm Project,'4 we 
obtain the empirical formula 


1 — P,(0) 


Po(L) 1 — i pa (20) 
t+ ar | 
where LF is in units of kilometers and 
P9(0) = lim Po(L) (21) 


is the point rain probability that depends on geographic location. A 
theoretical consideration leading to the empirical form (20) is dis- 
cussed in Appendix C. 


Ili. OUTLINE FOR CALCULATING RAIN ATTENUATION DISTRIBUTION 


3.1 Path length dependence of median attenuation a,, and standard 
deviation S. 


From the lognormal approximations (1) and (14), it can be shown® 
that: 








s= In 1 + 3 (22) 
Sp= In fi + 2 (23) 
B 
Om = &-exp = | (24) 
Bm = B-exp | = ; (25) 
where 
& = Ey{a(t} (26) 
B = E,{B(d)} (27) 
oq = Hyioe(t)} — & (28) 
and 9 
op = E,{B(t)} — B. (29) 


E,{~} denotes a statistical (time) average under the condition that 
rain is falling on the radio path of length L, Eo{~} denotes statistical 
(time) average under the condition that rain is falling at the location 
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of interest. In eqs. (27) and (29), we assume that the long-term, large- 
sample, conditional statistical average 6 and variance of are inde- 
pendent of position s in or near the radio path of interest. Therefore, 
we omit the position s argument in these equations. 

By using eqs. (6), (22), (28), (24), and (25), we can derive formulas 
for the dependence of S,(L) and a,,(Z) on the radio path length L. The 
lengthy derivations are given in Appendix A. The results are 











S2(L) = In Po(L) f 4 H(L) ee -1]| (30) 
and 
Pee Bm Ls 5p eXP | = 5 =e | | (31) 
where 
vals, 8") = => (H.L8(8, )-6(6', 0] — Bu (33) 


is the spatial correlation coefficient**:** between 8(s, ¢) and 8(s’, t), and 
Ca(s, 8’) = ofu'Pu(s, 8’) (34) 


is the spatial covariance function**> of G(s, t) and G(s’, t), Euf{~} 
denotes the unconditional statistical (time) average including both 
raining time and nonraining time, 8, and o%, are the unconditional 
statistical mean and variance, respectively, of 8 as defined by eqs. 
(48) and (49) in Appendix A. 

In eq. (32), the integration volume is the entire radio path (Fig. 1) 
and is a function of both path length Z and wavelength \. However, 
the spatial correlation coefficient wy, of (point) rain attenuation 
gradient 8 defined in eq. (33) is not a function of radio path length L. 

If the random fluctuations of the $s were “coherent” along the 
entire radio path, then y,, H(L) and Po(L)/Po(0) would be identi- 
cally unity. Under such conditions, S, would be identical to Sg 
and am(L) would be equal to Bm-L as expected. The complexity 
of path length dependencies of S_ and am in eqs. (30) to (84) is caused 
by the partially correlated, random fluctuations of 8s at various points 
in the radio path. 

We postulate y, to have the functional dependence on distance 


_ GQ, av) 
vu = (ea, AV) +a eo) 
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within the range of interest, where 
d= |s—s’| 


is the distance between the two observation points (s) and (s’), and 
G(v, AV) is a characteristic distance at which y, = 1/v2. The de- 
pendence of 8 on wavelength \ (Fig. 2) and sampling volume AV 
(Section 2.3) indicates that the characteristic distance G may also be 
a function of \ and AV. However, in Table I, the exponents 7 are all 
very close to unity for frequencies ranging from 11 to 60 GHz. This 
means @ is approximately linearly proportional to R in this frequency 
range. Therefore, the characteristic distance G of 8 is approximately 
equal to that of point rain rate & and will not be very sensitive to fre- 
quency in the range from 11 to 60 GHz. 

Substituting (35) into (82) and carrying out integrations over p’ 
and ¢’ yield 


AnG [* bedz sh L Fdz’ 
H(L) = —>- é; — do{ —— 
ae, hl oe Se ee de QR)? ae 


where 
P= VW — VE + pcosa-n WY +h) — pcos? (37) 
ve — pcos é 
Wo= G+ (2 — 22 + p? + A2(2’) — 2ph(z’) cos 6 (38) 
c= @ + e+ @ 2) (39) 
G=¢o- ¢’. (40) 


The remaining integrations can be carried out numerically by com- 
puter. The calculated H(L) for G = 0.75, 1.5, and 3 km, respectively, 
are shown in Fig. 4. 

Notice that the radius h(z) of a ‘‘radio beam cross section” is on the 
order of several meters, whereas the characteristic distance G 1s on the 
order of kilometers (see Section 4.1). Therefore, 


G > h(z) (41) 


for most radio paths at frequencies above 10 GHz. Imposing the con- 
dition (41) reduces the complicated integrations in eq. (86) to the 
simple result 


2 2 2 
H(L) = 7 Wain[ B+ ai+ S| - i+ B+ (42) 


The differences in numerical values of H(L) calculated by (36) and 
by approximation (42) are less than 0.1 percent within the range of 
interest. Notice that H(L) is practically independent of wavelength 


1062 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1975 


0.8 


0.6 


H{L) 


0.4 


0.2 


5 10 15 20 25 30 35 40 
PATH LENGTH LIN KM 


Fig. 4—Dependence of H(L) on path length Z and characteristic distance G. 


\ because of condition (41). Therefore, we omit the wavelength speci- 
fication on Fig. 4. 

Thus, we have obtained all the necessary relationships. The pro- 
cedure for calculating rain attenuation distributions from point rain 
rate distributions is summarized in the next section. 


3.2 Procedure for calculating rain attenuation distribution 


(2) 


(22) 


(222) 


(zu) 
(v) 


(v2) 


Convert the measured distribution of point rain rate with 
T S 2 min into the distribution of 2-s point rain rates by the 
conversion factor in Ref. 15. 

If Po(0) with T <1 min is not available at the location of 
interest, use approximation (19) and the Weather Bureau 
hourly precipitation data to estimate Pp. 

Estimate the lognormal parameters R, and Ser of the 2-s 
point rain rate distribution by a least-squares approximation. 
This step is carried out by a computer iteration process to 
obtain the (R», Sr) pair that minimizes the differences (i.e., 
the sum of squares of errors) between the data points and the 
lognormal approximation. 

Calculate 6, and Sg by formulas (15) and (16). 

Calculate Po(L), am(L), and S,(L) by formulas (20), (80), and 
(31). 

Substitute Po(L), am(L), S.(L) into eq. (1) to give the attenua- 
tion distribution. 
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IV. COMPARISON OF CALCULATED RESULTS WITH EXPERIMENTAL DATA 


The measured rain attenuations in many experiments contain not 
only the path rain attenuation but also the transmission loss owing to 
wet radomes. The presently available information is insufficient for 
accurate estimation of wet radome attenuation as a function of rain 
rate, wind direction, radome shape, size, material, and surface aging 
effects. 

Based upon two measurements of wet radome attenuations dis- 
cussed in Appendix D, we assume that a flat, vertical radome causes 
1.5-dB attenuation during heavy rain. Therefore, 3-dB attenuation, 
caused by a pair of wet radomes, is added to the calculated path rain 
attenuation and the result is compared with the measured data 
utilizing such radomes. In some experiments, the flat radomes are 
slanted inward to further reduce wetting the radome surfaces. The 
attenuation caused by a pair of such radomes during rain is assumed* 
to be less than 3 dB. More detailed discussion of the radome problem 
is given in Appendix D. 


4.1 Determination of characteristic distance G 


From the 3-year (1971-1973) distribution of 1-min point rain rates 
measured at Merrimack Valley, Massachusetts,** the conversion factor 
in Ref. 15, and the Weather Bureau data, we obtain the following 
approximate lognormal parameters of the distribution of 2-s point rain 
rate: 


P (0) ~ 3.38%, (43) 
Rm & 1.23 mm/h, (44) 

and 
Sre& 1.34. (45) 


Following the procedure outlined in Section 3.2, we use these param- 
eters to calculate a family of rain attenuation distributions as a func- 
tion of the distance parameter G. Figure 5 displays the results for an 
18-GHz, 4.3-km path subject to Merrimack Valley rain and makes 
comparison with measured data (1971-1973) at the same location 
(Ref. 36 and Tables II and III). The radomes on this path are vertical 
and almost flat. The solid curves on Fig. 5 are calculated path rain 
attenuations plus assumed 3-dB radome attenuation. Figure 5 indi- 
cates that 

G 1.5 km (46) 


provides good agreement ; therefore, a 1.5-km characteristic distance** 


“The slanted radomes may get wet during some heavy rains accompanied by 
strong wind gusts. 
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Fig. 5—Determination of characteristic distance G by comparing experimental 
data with calculated results (solid lines) using rain rate data in Fig. 10. 


is used for the calculations and comparisons with other sets of data at 
other locations in the following sections. 

From eqs. (35) and (46), it is easily shown that d S$ 15 km for a 
spatial correlation coefficient, y, 2 0.1. In other words, a “rain cell,”’ 


based upon a definition of y, 2 0.1 within the cell, has a typical 
spatial extent of 15 km. Obviously, the cell size depends on its definition. 


4.2 Comparisons of calculated results with data in Georgia 


Figure 6 compares the calculated result with data from a 5.1-km 
17.7-GHz path at Palmetto, Georgia, measured during two 1-year 
periods (November 1970 through October 1971 and August 1973 
through July 1974).* The radomes on this path are flat and canted 
inward. The calculated result is based upon the rain rate distribution 
measured by a tipping-bucket rain gauge at Palmetto in the same time 


*The data from November 1971 to July 1973 are not used because of inter- 
mittent troubles in the rain gauge and the magnetic tape recorder. 
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Table Il — Experimental data on rain attenuation distribution 
Rain Rate 
Path . . Radome . 
Ref Freq Polari- {| Fig. ; Data Used in 
No. Authors No. | (GHz) Ee pation No. Path Location Time Base Pay be Phsoretical 
poe Calculation 
1 Barnett, 30,37 | 17.7 5.1 H 6 Rico-Palmetto, 11/70-10/71 | Flat, slanted No. 5 of 
Bergmann, Lin, Ga. 8/73-7/74 Table 2 
Pursley 
2 Pursley 37 11.6 42; V 7 Atlanta-Palmetto, | 8/73-7/74 | Flat, almost No. 6 of 
Ga. vertical Table 2 
3 Lentz, Kenny 36 18.4 4.3 V 5 Merrimack 1971-1973 | Flat, vertical No. 7 of 
Valley, Mass. Table 2 
4 Semplak 38 18.5 6.4 V 8 Holmdel, N.J. 1968-1969 | Flat, slanted*| No. Hs 5 
Table 


* These radomes are shrouded by wooden rain shields. 
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No. Authors 


1 | Ruthroff, Bodtmann 

Jones, Sims 

Jones, Sims 

Ruthroff, Bodtmann, 
Osborne 

Lin 

Lin 

Lentz 
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Norbury, White 
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Easterbrook, Turner 


Table I!I|— Experimental data on point rain rate distribution 


Ref 
No. 


36 


16 


17 


Location 


Miami, Fla 
Miami, Fla 
Urbana, Ill 
Atlanta, Ga. 
Palmetto, Ga. 
Palmetto, Ga. 
Merrimack Valley, 
Mass. 


Holmdel, N.J.* 
Slough, England 


Southern England 


Time Base 


1966-1970 
8/57-8/58 
5 /69-4/72 
1966-1970, 

1973 
11/70-10/71 
8/73-7 /74 
8/73-7/74 
1971-1973 
1968-1969 
1970-1971 


5/61-5/62 
1963 


Rain Gauge 
Integration 


Time 


25s 


10stolh 


2-60 min 


Rain 
Gauge 


Weighing 
Gauge 
Weighing 
Gauge 
Weighing 
Gauge 
Weighing 
Gauge 
Tipping 
Bucket 
Tipping 
Bucket 
Ruthroff’s 
(Ref 2) 


Special 
Dropper 
Gauge 

? 


Estimated Lognormal 


Parameters 
he Sn | Po(0) 
2.48 1.54 | 0.026 
2.48 1.54 | 0.026 
1.1 1.47 | 0.033 
3.23 1.15 | 0.026 
3.10 1.18 | 0.031 
3.85 1.11 | 0.030 
1.23 1.34 | 0.033 
1.53 1.38 | 0.026 
0.42 1.4 0.044 


* Point rain rate distribution from 1968 to 1969 at Holmdel, New Jersey, is estimated from rain attenuation data on short paths at 


Holmdel. See Section 4.3. 
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Fig. 6—Comparison of 17.7-GHz rain attenuation data at Palmetto, Georgia, with 
calculated result (solid line) using rain rate data in Fig. 12. 


period. Figure 6 has two calculated attenuation curves, one without 
radome attenuation and another with 3-dB radome attenuation. 

In the 11-GHz band, the path lengths of interest may be as long as 
50 km. It is desirable to test the validity of the method for long paths. 
A preliminary comparison is shown in Fig. 7 for an 11-GHz, .42-km 
path between Atlanta and Palmetto, Georgia. The radomes on this 
path are flat and almost vertical. The attenuation and rain rate were 
observed during a 1-year period (August 1973 through July 1974). 
The calculated-plus-3-dB-radome-loss result is reasonably close to the 
data. In Figs. 7 and 13, notice that the measured attenuation and 
rain rate distributions are both somewhat higher than the lognormal 
approximations in the probability range from 10~? to 4 X 10°? percent 
time. We believe that these deviations are an artifact of the short 
observation time. A more critical test of this method for long paths 
awaits longer-term data. 
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4.3 Comparison of calculated result with data in New Jersey 


Rain attenuation experiments on two paths (18 GHz, 6.4 km, and 
30.9 GHz, 1.9 km) were carried out simultaneously in 1968 and 1969 
at Holmdel, New Jersey.*® However, local point rain rate distributions 
were not measured during this period. On the other hand, Bodtmann 
and Ruthroff!® have suggested a method for relating point rain rate 
distribution to path rain attenuation distributions on short paths. 
Thus, by using the short-path attenuation data from the 30.9-GHz, 
1.9-km path and Bodtmann and Ruthroff’s method, the 2-year dis- 
tribution of point rain rate at Holmdel was estimated as shown in 
Fig. 11. Based on this estimated rain rate distribution, we calculate 
the 2-year rain attenuation distribution on an 18-GHz, 6.4-km path 
and compare this with the measured result in Fig. 8. The radomes in 
these experiments are flat, slanted inward, and shrouded by substantial 
wooden rain shields. We believe that the wet radome attenuation, with 
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11.6 GHz,42 KM,V—POL. 


PERCENT TIME ATTENUATION EXCEEDS ABSCISSA 


1 YEAR (8/73—7/74) 
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RADOME ATTENUATION 
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Fig. 7—Preliminary comparison of 1-year rain attenuation data from an 11-GHz, 


42-km path in Georgia with calculated results (solid line) using rain rate data in 
Fig. 13. 
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Fig. 8—Comparison of 18.5-GHz rain attenuation data in Holmdel, New Jersey, 
with calculated result (solid line) using rain rate data in Fig. 11. 


such protection, is negligible. Therefore, the calculated curve in Fig. 8 
contains only path rain attenuation. 

Figures 6, 7, and 8 show that the calculated results agree reasonably 
well with measured data. 


4.4 Excluded rain attenuation data 


Many sets of rain attenuation data in the literature are not included 
in the comparisons in Sections 4.2 and 4.3 because of one or more of 
the following reasons. 


(2) Many experiments used cone-shaped or hemispheric-shaped 
radomes. The transmission loss resulting from rain running on 
a pair of such radomes may vary from 0 to 14 dB depending 
on frequency, rain rate, and radome surface aging. The un- 
certainty in estimating such radome attenuation is too large 
for a meaningful comparison between the data and the cal- 
culated path rain attenuation distribution. 
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(21) The antennas in the experiment are not covered by any 
radomes, exposing the antenna feeds and reflecting surfaces 
to rain, snow, and ice. No information is available for esti- 
mating the possible transmission loss from the wetting of these 
elements. 

(412) The published information does not specify whether the 
antennas are covered by radomes or not. The configuration 
of the radome, if used, is also unknown. 

(zv) The polarization of the transmitted signal is unstated. 

(v) The time base of the experiment is too short to yield long-term 
representative statistics. 

(vz) No rain rate data with J < 2 min is available at or near the 
location of the rain attenuation experiment. 


V. OUTAGE ESTIMATION FOR 11- AND 18-GHz RADIO LINKS 


For a constant transmitter output power, the dependence of fade 
margin F')(L) in dB on the path length L is 


Fo(L) <r F'o(Lo) — 20 logio0 (=) dB, (47) 


where Lo 1s a reference repeater spacing and F'9(Lp) is the corresponding 
reference fade margin. For 11- and 18-GHz radio, reasonable clear-day 
reference fade margins are 


Fy) = 40 dB for 18 GHz at Ly = 4 km 
and 
Fy) = 40 dB for 11 GHz at Ly = 40 km. 


A radio outage occurs when the path rain attenuation plus the wet 
radome attenuation exceeds the clear-day fade margin Fo. By sub- 
stituting the fade margin (47) into the attenuation distribution (1), we 
can calculate the probability of radio outage per hop as a function of 
repeater spacing L. As an example, Fig. 9 shows the outage probabili- 
ties* for 11- and 18-GHz radio links in Atlanta, Georgia. 

The wet radome attenuation Ar is assumed to be 3 dB in these cal- 
culations of outage probabilities. 


Vi. SOME QUALIFICATIONS 


This section discusses some limitations, approximations, and assump- 
tions in the theoretical calculation procedure, the data employed, and 
the calculated results. 


* Multipath interference fading can also cause outages. An empirical formula for 
estimating the multipath-caused outage probability can be found in Ref. 41. 
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Fig. 9—Expected outage times of 11- and 18-GHz radio links as a function of hop 
length in Atlanta, Georgia. 


6.1 Uncertainty in estimation of lognormal parameters Po, Rm, and Sz 


Some point rain rate measurements report only the heavy rain (e.g., 
=30 mm/h) portion of the distribution, neglecting the light rain sta- 
tistics completely. Table III indicates that the median rain rates R» 
at many locations are less than 4 mm/h. In other words, the major 
portion (<98 percent) of the distribution is missing, and accurate 
estimation of the statistical parameters Rm and Sr from the tail 
region (<2 percent) is difficult. 

Furthermore, high rain rates (e.g., >140 mm/h) require a long 
observation time to yield representative, long-term statistics. The 
time bases of most available data may not be sufficient to yield stable 
statistics for these extreme rain rates. For example, at Newark, New 
Jersey, the l-min point rain rate exceeded 180 mm/h only once in the 
5-year period processed by Bodtmann and Ruthroff. To obtain rea- 
sonably stable statistics, we need a sample size much larger than 1. 
The omission of light-rain statistics together with the inherent in- 
stability of the extreme rain rate statistics causes considerable un- 
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certainty in the estimation of Po, Rm, and Sr. This uncertainty can 
be reduced significantly if the light rain portion of the distribution is 
also measured and reported. 


6.2 Path length dependence of P, 


The empirical formula (20) for the dependence of P» on path length 
Lis obtained from the rain-gauge network data in Florida. The test 
of the applicability of this empirical formula to other locations and 
the improvement of this approximation will require further multiple 
rain gauge experiments at other locations. 


6.3 Dependence of point rain rate distribution on rain gauge integration time 


The dependence of the point rain rate distribution on rain-gauge 
integration time 7’ has been obtained by Bodtmann and Ruthroff'® 
from a 2-year experiment at Holmdel, New Jersey. Since a 2-year time 
base may not be sufficient to yield stable statistics for high rain rates, 
a longer time base may be needed to improve this empirical conversion 
factor. The applicability of this result to other geographic locations 
also remains to be verified. 


6.4 Radome problem 


In the comparisons of calculated and measured rain attenuation 
distributions, the wet radome attenuations are assumed values. To 
improve this approximation, a more systematic experimental study on 
the dependence of radome attenuation on rain rate is needed. 


6.5 Anisotropic spatial correlation 


At some geographic locations, the squall lines of heavy rain may 
have a predominant orientation related to the predominant orienta- 
tion of weather fronts.*7-“ This means the spatial correlation Y may 
depend not only on the spacing but also on the orientation. However, 
the presently available information is not sufficient for a quantitative 
description of such an anisotropic correlation. Therefore, we use the 
isotropic correlation coefficient (35) throughout our theoretical cal- 
culations. Some of the difference between calculated and measured 
attenuation distributions may be caused by neglecting the anisotropy 
of the spatial correlation function. 


VII. CONCLUSION 


By using lognormal approximations, we have described a method 
for calculating rain attenuation distributions on microwave paths. 
The calculated results agree reasonably well with experimental data 
in Massachusetts, New Jersey, and Georgia. This procedure may 
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prove useful for the design of radio paths using frequencies above 10 
GHz. To demonstrate the application, Fig. 9 shows the calculated 
outage probability as a function of repeater spacing for 11- and 18-GHz 
radio links in Georgia. 
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APPENDIX A 
Derivation of Formulas Relating Rain Attenuation Distribution to Path Length 


Since the random fluctuations of the attenuation gradient @(s) at 
various positions in the radio path are partially correlated, we require 
the spatial covariance function of 8 to relate the variance of 8 to the 
variance of the path attenuation a. However, raining intervals at 
separated observation points are not always coincident. Hence, 
definition of the spatial covariance function for 6 requires a time base 
for all Bs common to all observations. A natural common time base 
fulfilling this requirement is the total time, including both raining and 
nonraining intervals. This means the unconditional* statistical means 
and variances of 6 and a@ are also needed in this formulation. We there- 
fore define 

B. = E,{B8()} (48) 
and ; 
Ogu = Ey {B?(t)} — Bu (49) 


as the unconditional mean and variance, respectively, of 8, where 
E,{~} denotes the (unconditional) statistical average including both 
raining and nonraining time. We assume that, on a long-term basis, 
8. and o%, are independent of position in or near the radio path of 


* Fading caused by other atmospheric effects, such as multipath interference and 
“earth-bulge,” is not treated in this paper. Therefore, the path rain attenuation a 
and the attenuation gradient 8 are taken to be identically zero during nonraining 
time. | 
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interest; therefore, we omit the position argument(s) in eqs. (48) and 
(49). Similarly, 
&, = H,ia(t)} (50) 
Sau = Ey,{a?(t)} — a (51) 


are the unconditional mean and variance, respectively, of a. 

Based upon the definitions (48) to (51) and the relationships (22) 
to (29), it can be shown that conditional and unconditional means 
and variances are related by 


B = B./Po(0) (52) 

& = &,/Po(L) (53) 

Sin |Po(0 }1 3 a || (54) 
and 

Sein | Pott) E ts a (55) 


Obtaining the unconditional statistical averages of both sides of eq. 
(6) yields 
G. = Bu L. (56) 


Substituting eqs. (6) and (56) into definition (51) yields 


= BLOG lon IL ae 


: ; : Oy yas at 2 
x fo [86S 0 -B(S dade! | — Bul 


~ im 1 Bs [haw 


x ff teL6(s, 0-81", 0] — Bu} dodo’. (67) 





Let us define a spatial covariance function** C’,(s, s’) for B(s, t) and 
B(s’, t) such that 


Ca(s, 8’) = E.{8(s, )-B(8', 0} — By (58) 
In other words, 
Piss es) (59) 
O Bu 


is the (spatial) correlation coefficient*4;* between G(s, t) and G(s’, t). 
Substituting definitions (58) and (59) into (57) yields 


ai a TBu oP Oke H(L), (60) 
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where 


Hay = 4 law boc [fom i [ous s')dvdv'. (61) 


Substituting (56) and (60) into (55) gives 





S2(L) = In P,(L)- E 4 H(L) 28 | (62) 
Combining (54) and (62) gives 
S2(L) = In Po(L): I + H(L) | ee a ||. (63) 
Combining eqs. (24), (25), (52), (53), and (56) gives 
ete Bm: DL: BrpeXP — | (64) 


This completes the derivation for S2(L) and a,,(L). 


APPENDIX B 
Lognormal Distribution of Point Rain Rate 


Figures 10 to 13 display the distributions of 2-s point rain rate 
observed in Miami, Florida; Urbana, Illinois; Atlanta and Palmetto, 
Georgia; Merrimack Valley, Massachusetts; Holmdel, New Jersey; 
and Southern England. The time bases range from 1 to 6 years. It can 
be seen that these distributions of 2-s point rain rate are very close to 
the lognormal approximation in the range below 100 mm/h. The rain 
rates beyond 100 mm/h are generally separated by more than 3 sigma 
from the median, and constitute the tail of the lognormal distribution. 
A very long observation time (e.g., more than 20 years) is necessary 
to obtain stable statistics of extreme rain rates beyond 100 mm/h.*—* 
Since the time bases of the data in Figs. 10 to 13 are much less than 
20 years, the deviations of the data from the lognormal distributions 
in the tails are not unexpected. 

The rain gauge integration time 7 in the original data range from 
1.5 s to 2 min, depending upon the source. As discussed in Section 2.3, 
the appropriate integration time 7’, corresponding to 1-m* sampling 
volume in our formulation, is about 2 s. From 2-year experimental 
data at Holmdel, New Jersey, Bodtmann and Ruthroff! have obtained 
an empirical relationship for the dependence of point rain rate dis- 
tributions on rain gauge integration ¢ time in the range 


15s< 7 s 120s. (65) 
This empirical result enables us to convert the original data into the 


2-s point rain rate distributions shown in Figs. 10 to 13. 
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Fig. 10—Lognormal distribution of 2-s point rain rate at Urbana, Illinois; Merri- 
mack Valley, Massachusetts; and Southern England. 


APPENDIX C 
Derivation for Path Length Dependence of P,(L) 


Let Po(L1) and Po(L; + AL) be the probabilities that rain falls on 
the radio path with length L,; and an extended path of length L; + AL, 
where AL is a small incremental length. The relation between Po(Z1) 
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Fig. 11—Lognormal distribution of 2-s point rain rate at Miami, Florida and 
Holmdel, New Jersey. 


and Po(L; + AL) can be written as 


where AP )(AL) is the incremental probability of rainfall associated 
with the incremental length AL. This incremental probability can be 
written as 


AP,(AL) = AP)(AL|no rain for 0 S$ L S Ly) 


< 
-P(norainforO = LS 1), (67) 
< 


where AP,(AL|no rain for 0 S$ L S I) is the (incremental) prob- 
ability that rain falls on the incremental length AL under the condi- 
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Fig. 12—Lognormal distribution of 2-s point rain rate at Palmetto and Atlanta, 
Georgia. 


tion that rain is not falling on the path L,. This condition 1s required 
because rainfall on LZ, and AL are partially correlated. 
We assume that 


AP,(AL|no rain forO0 S$ LD S Ly) « AL. (68) 
The justification for this assumption is 


(2) Po(Z) is expected to be a smooth, continuous function of J, 1.e., 


AL-0 
lim AP o(AL|no rain forO0 S$ L S Iy) = 0. (70) 
AL-0 
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RAIN GAUGE INTEGRATION TIME T = 2 SECONDS 


O 1 YEAR (8/73-—7/74) 
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PERCENT TIME RAIN RATE EXCEEDS ABSCISSA 
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Fig. 13—Lognormal distribution of 2-s point rain rate at Palmetto, Georgia. 


(22) The rain-gauge network data (Fig. 3) indicate that the slope of 
Po(L) is not zero, 1.e., 
AP 


AL ~% for L ¥ 0. 


Let b be the proportional parameter in assumption (68). Then 


AP,(AL|no rain for 0 S$ L S Ly) = b(L)-AL. (71) 


The unknown proportional parameter 6(Z) will be determined from 
rain-gauge network data. 
By definition, 


P(no rain forO S$ L S Ly) = 1 — Po(kh). (72) 
Combining eqs. (67), (71), and (72) yields 
AP (AL) = b(L)-AL-(1 — Po(hi) J, (73) 
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from which 
dP o(L)/dL 


T— P.(E) = OM). (74) 


Integrating (74) yields 
L 
P,(L) = 1 — cexp |- b(@ae| (75) 
0 


where c 1s an unknown constant to be determined by the condition 


lim Po(L) = Po(0). (76) 
Applying condition (76) to (75) gives 
P,(L) = 1 — [1 — P,(0) ] exp |- [beac (77) 


Since the rain-gauge network data yield Po(ZL) at only a few dis- 
crete distances, we need the quantized version 


AP o(Li)/AL 


—_ Po(L,) = b(L;), C= 1; 2, 3, rey Nn, (78) 


of eq. (74) for estimating b(L). By using eq. (78) and the rain-gauge 
network data in Fig. 3, we can calculate b(L;) at several discrete 
points. From these results, we find that b(L) can be approximately 
described by the empirical formula 


0.0282 


Nw = 
b(L) aS LL (km). (79) 
Substituting (79) into (77) and carrying out the integration yields 
1 — Po(0 
[+ a | 


which is the same as (20). 

Figure 3 shows that the empirical result (80) is reasonably close to 
all the data points measured by the Florida rain-gauge network. 
Admittedly, eqs. (79) and (80) are empirical. Further theoretical work 
and multiple rain-gauge experiments are needed to improve these 
approximations. 


APPENDIX D 
Transmission Loss Due to Wet Radomes 


A 20-GHz experiment by Anderson*® on a section of a 90-ft diam- 
eter radome, pertaining to an earth satellite radio link, indicated a 
transmission loss of 2 to 3 dB at 10 mm/h rain rate when the radome 
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was new. However, after 6 months of weathering, the transmission 
loss increased to 8 dB at 10 mm/h rain rate. A 4-GHz experiment*® 
on an earth-satellite radio link indicated a 3-dB transmission loss re- 
sulting from the wet radome. These experimental data agree rea- 
sonably well with theoretical calculations®!—* for rain rates $10 mm/h, 
assuming laminar water flow on hemispherical radome surface pointing 
towards the zenith. 

For typical 11- and 18-GHz terrestrial radio paths, the transmission 
losses from wet radomes are expected to be smaller than those of earth- 
satellite radio links because of the smaller radome size, different 
radome shape, and orientation. However, the terrestrial radio passes 
through a pair of radomes on each link; therefore, the contribution of 
wet radome loss to the total path attenuation may not be negligible. 
Theoretical calculation of wet radome attenuation pertaining to 
terrestrial radio links is not available at the present time because of 
the difficulty in calculating the nonuniform thickness of the water film. 
A semiquantitative experiment®® was carried out on the 12.2-GHz radio 
link between Murray Hill and Crawford Hill, New Jersey (22 miles). 
The 10-ft dish antenna was covered by a cone-shaped radome that 
was made of resin-coated fiberglass and, at 10 years of age, was well 
weathered. Water was sprayed on the radome-covered antenna by a 
manually controlled sprinkler. The results indicated that a uniform 
light sprinkle caused approximately 2.5-dB attenuation, whereas a 
very heavy spray (maximum stream of water) caused between 4- and 
7-dB attenuation. After the spray was turned off, 2 to 3 minutes 
elapsed before the signal recovered to within 1 dB of its nonfaded level. 
The residual wet radome attenuation is estimated to be 0.5 dB. 

On an 18-GHz, 4.3-km path at Merrimack Valley, Massachusetts, 
Ikenny*® has also observed a residual wet radome attenuation of 0.75 
dB (.e., 1.5 dB for two radomes). 


APPENDIX E 
List of Symbols and Their Definitions 


A, Collecting aperture of a rain gauge. 

Ar Attenuation by two wet radomes on a radio link. 

C's(s, 8’) Spatial covariance function of G(s, ¢) and 8(s’, t) as de- 
fined by eq. (58). 

dv = pdpdddz. 

E,{~} Conditional statistical (time) average under the condition 
that the point rain rates, in the radio path of length L 
(see Fig. 1), are not all zero. 
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V-Pol 


Conditional statistical (time) average under the condition 
that the point rain rate (defined in Section 2.3) is not 
zero at the position of interest. 

Unconditional statistical (time) average including both 
raining and nonraining time. 

Complementary error function. 

Fade margin of radio links. 

Characteristic distance defined in eq. (85). See also Sec- 
tion 4.1. 

Radius of the circular cross section of radio beam at a 
distance z from the transmitter. See Fig. 1 and eq. (8). 

Defined by eqs. (82) and (86). 

Horizontal polarization. 

The path length of a radio link; see Fig. 1. 

A reference repeater spacing defined by eq. (47). 

Natural logarithm. 

Probability that rain attenuation a exceeds A. 

Probability that rain rate R exceeds r. 

Probability that attenuation gradient 8 exceeds B. 

Probability that rain is falling on a radio link of length L. 

= limz,+oPo(L); the probability that rain is falling at the 
position of interest. 

= wh?(z); area of the circular cross section of radio beam 
at a distance z from the transmitter. See eqs. (3) and (4). 

Point rain rate measured by a 1-m* sampling volume 
located at s. 

Median value of the point rain rate R during raining time. 

A vector to denote the position (p, ¢, 2). 

Standard deviation of In a during raining time. 

Standard deviation of In R during raining time. 

Standard deviation of In 8 during raining time. 

Integration time of rain gauge. 

Time. 

Defined in eq. (10). 

Vertical polarization. 

Average falling velocity of raindrops. 

Incremental sampling volume for measurement of point 
rain rate. See Section 2.3. 

Distance from the radio transmitter (Fig. 1). 

Rain attenuation in decibels. 

Median value of a during raining time. 

Mean value of a during raining time. 
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Cua 


Cau 


OB 
OBu 


p 


ry 
Wu(s, §’) 
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A Geometric Derivation of Forney’s 
Upper Bound 


By J. E. MAZO 
(Manuscript received January 17, 1975) 


Effective analyses of performance for detection schemes that optimally 
decode digital data in the presence of intersymbol interference have been 
slow tn coming. Recently, however, Forney has given an upper bound on 
the bit error probability for maximum-likelithood sequence estimation. 
Starting from a standard geometrical framework, we give a much simplified 
derwation of this upper bound. Our derivation places the validity of this 
amportant bound more in evidence in that the concepts of whitened matched 
filter and error event are not introduced. 


Let a;,7 = 1, 2, ---, N, be independent, equilikely binary random 
variables taking values +1. Data transmission usually involves 
estimating the a; from a pulse sequence of the form 


N 
Dah(tt—jT), —%” <ti<o, (1) 
j=l 


which is observed in white gaussian noise of (two-sided) spectral 
density No/2. In (1), the minimum assumption put on the pulse wave- 
form h(t) is that it be Le. One possible detection procedure is to decide, 
on the basis of the received noisy signal, which one of the 2” equilikely 
signals given in (1) was ‘‘most likely’? (maximum-likelihood sequence 
detection) and use the sequence {a,} which is associated with that 
sequence as the detected symbols. As N grows large, the probability of 
deciding incorrectly on the sequence approaches unity; however, the 
real question revolves about the bit-error probability for maximum- 
likelihood sequence detection. Important work on this problem was 
done recently by Forney,! who showed that, under certain conditions 
on h(é), the bit-error probability for large signal-to-noise ratios goes 
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exponentially to zero as 
2 
P, & (coeff.) exp ( Trnin 


~ ANG 





) Ny 0, (2) 


where din is the minimum distance between the signal sequences in 
(1); i.e., if we add a superscript to distinguish among sequences thus 


N 
sO) = Vi aPh(tt —jT),-—e7 <ti< o 41=1,2,---, 2, (3) 
j=l 


then 
Pi Vine tain | 1s (#) — (4) |Pdb. (4) 


N-o ik 
ixk 
Forney’s demonstration consists of two steps. First, a lower bound on 
P. of the form (2) is established, valid for any h(t). Second, if H(w) 
denotes the Fourier transform of h(t) and 


= i | 1 (w) |2e**dw = O for integer k, |k| > v, v integer, (5) 
then an upper bound for P, can be given which is convergent for large 
signal-to-noise ratios and which, furthermore, also has an asymptotic 
form given by (2). ' 

In our opinion, Forney’s discussion of the upper bound is sufficiently 
complicated that some question remains as to how firmly the result is 
established. We shall give a much simpler derivation, but first let us 
review the situation when v = 0, 1.e., when there is no intersymbol 
interference. Using the reduction to the standard geometrical picture,’ 
the signal points (sequences) received in the absence of noise are as 
shown in Fig. 1. These signal points are to be regarded as being per- 
turbed by spherically symmetric, V-dimensional, zero mean gaussian 
noise; the variance of each component of the noise is No/2. Thus any 
point in the N-dimensional space may be received and the maximum- 
likelihood decoder chooses the unperturbed sequence nearest to the 
received point as the transmitted one. The decoding regions are shown 
in Fig. 2 by dashed lines, and are labeled R;,; in an obvious way. 

Now assume we transmit (1, 1) and ask for the probability that the 
first bit 1s in error. This is the same as the probability that the received 
signal point is in R_1,1 U R-1,-1, or equivalently that the received 
signal point is to the left of the line labeled S in Fig. 1. In N dimensions 
it would be the probability that the received signal point is on the 
opposite side of an (N — 1) dimensional hyperplane. This is clearly a 
simple one-dimensional gaussian problem having the well-known Q- 


1088 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1975 


Ri 4 


(=) 1) 8 (1,—1) 


R_4-4 





Fig. 1—Four signal points corresponding to sequences for N = 2 in the absence of 
intersymbol interference. 


B (1,1) 


(—1,—1) 





Fig. 2—Four signal points with intersymbol interference. 
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function for an answer, independent (because of the simple geometry) 
of the dimension or of which bit in the sequence is transmitted. 

When intersymbol interference is present, the error probability for 
the kth bit may well depend on k and N. In addition, the surface S 
which separates the sequences that have a, = +1 from those which 
have a, = —1 is no longer a hyperplane, although it is made up of 
segments which are hyperplanes. Finally, and perhaps a bit vaguely, 
the “shape” of the surface may depend on k. An example for N = 2 is 
given in Fig. 2 showing the separating surface for the first bit. 

Our goal is to derive Forney’s upper bound by geometrical arguments 
about as simple as those used in the discussion of Fig. 1. 

As in (3), we consider signal points identified by their respective 
data sequences {a,;}” and label them with a superscript. We focus 
on the kth bit being in error, and define sets A and B: 


A = {ala = +1}, B= {ala = —1} 
{b@}. (6) 


We are mainly interested in the chance that the maximum-likelihood 
decoder selects a point b € B, given that a particular sequence a) 
(say), a) € A, was transmitted. One upper bound on this is the union 
bound 


d(a™, b) 
P.(k) Ss me Q (“e’) , (7) 


where d(a, b) is the euclidean distance between a“ and b; Le., if / 
labels the particular b sequence, 


Pa, b) = fC Is (t) — s(@) |2dt 


-5 [ HeP 


vT 





N oar: 
sy (a? — bP )e| dw. (8) 
j= 


On writing (8), and henceforth, we set T = 1. Equation (7) is a bad 
bound because it includes too many terms on the right-hand side. This 
is easily seen by applying it to the N-dimensional hypercube (no inter- 
symbol interference), for which we obtain (ignoring unessential 
coefficients) 

Pak) S28 ere eNu, (9) 


where d? is the length of a hypercube edge. Thus, the bound, for any 
fixed No, approaches o as the length of the sequence N increases. 
Our next step will be to make some simple observations about the 
geometry of the received signal points [ when (5) is true ] that will allow 
us to delete most of the terms on the right-hand side of (7). Following 
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Forney, we are motivated to define another set B, of signal points 
which is a subset of B. A vector b € Bi if {b; — a$?}7_, (after deleting 
all zeros which begin the sequence and end the sequence of coefficients 
{b; — a$”}) does not contain v or more consecutive zeros either to the 
right of the kth position or to the left of the kth position. Forney’s 
upper bound then reads 


d(a, b) 
Pm) sy o( So). (10) 


To see why no further terms need be included in (10), select an arbi- 
trary signal point b* not included in the sum in (10); 1e., b* © B 
— B, = B*. From the way things have been defined, we may write 


a) = (a, 1, a1, a2, as) (11) 
b* (8%, aan G1, @2, 83), (12) 


where a; — 61 does not contain »y consecutive zeros in its coefficient 
sequence and a3 — 83 ~ 0. The 1 in (11) and the (—1) in (12) occur 
in the kth position, and a is at least of dimension », corresponding to 
the v (or more) positions where a! and b* are to agree. Now ar — Br 
may or may not contain »v consecutive zeros,’ and we distinguish these 
two cases in our discussion. First assume that ea, — 61 does not contain 
vy consecutive zeros. Then note that 


bY = (Bx, = 1s G1, Q@2; a3) S By (13) 
a®) = (az, 1, @1, @2; Bs) j= A. (14) 


Statement (13) is true by the absence of » consecutive zeros in BL — az 
and also in 61 — a1; (14) is true because af = +1. We may at this 
point imagine the four signal points a, b*, b®, a® in general position 
as in Fig. 3. Now focus attention on the triangle (a, a®, b*).4 We 
have (letting 0; denote a string of at least 7 zeros) 


#(a®, b*) = la — O*IP 
= lat ~~ BX, 2, ay — 61, 0,, a3 — B||? 
= |la& — BY, 2, a1 — Bill? + |las — Bsll?. (15) 


The last step in (15) follows from the Fourier integral expression (8) 
for the distance and from (5), the requirement that intersymbol inter- 
ference not extend beyond v. The right member of (15) can readily 
be seen from (11, 12) and (13, 14) to be d?(a™, b*) + @(a™, a). 


t Recall that a string of zeros in the beginning does not count. 
t We drop bold-face notation for vectors here, and also allow ourselves the freedom 
of writing the subscript L as a superscript. 
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bil) all) 


a(2) 


Fig. 3—The four signal points defined in the test illustrated in general position 
(not necessarily planar). 


Hence, 
Pa, b*) = (a, b*) + P(a™, a®) (16) 


and (a, a®@), b*) forms a right triangle. In an entirely similar manner, 
one verifies 
@(a®,b) = B(a™, b®) + &(a™, a®), (17) 


implying that (a, a®, b) is a right triangle. Since b® — b* = a 
— a”), we have 
ia, a®) = d(b™, b*) (18) 


and the additional fact that the four points lie in the same plane. 
Equations (16), (17), and (18), and planarity, imply that (a, a®, 
6%, 6*) form a rectangle as shown in Fig. 4. This demonstration assumes 
that there are v consecutive zeros to the right of the kth position and 
not to the left. If we interchange the words “right” and “‘left’’, the same 
type of demonstration will apply. There remains the case when there 
are v consecutive zeros both to the right and to the left of the kth 
position. In this case we write 


a) = (a3, a3, at, 1, a1, a2, as) (19) 
b* a (6, a’, BY, ~~ 1, Bi, 2, Bs), (20) 


where a3 — 83 #0 #4 ak — BY and neither a; — 61 nor a¥ — B4 con- 
tain y consecutive zeros in their coefficients. Further, we are to assume 
a, and a each have dimension at least v. If we define 


bY = (a3, a, BY, =1, Bi, a2, a3) & B, (21) 
a) (83, a3, of, 1, a1, ae, Bs) eA; (22) 
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the proof that (a, a®, b®, b*) is a rectangle can be carried out using 
the same techniques as earlier. 

Figure 4 makes it clear why, if a“ is transmitted, terms like b* do 
not have to be included in the right-hand side of (10). The term 


o() 


in (10) is the probability that, if a™ is transmitted, the received signal 
will be on the ‘‘wrong side”’ of the hyperplane H which perpendicularly 
bisects the line (b!a1). Now b* only needs to be included in (10) if its 
associated decoding region contributes some set of points of positive 
measure not accounted for in some other way. Thus, from Fig. 4, this 
is only the case if these new points were on the same side of H as 
a”, But any point on that side is closer to a® than b* and, hence, 
would never be decoded into b*. Hence the gaussian measure of the 
decoding region for b* is already included in the term 


If we calculate further upper bounds for (10) by letting N = o~, 
we obtain a bound independent of k. Averaging this over the possible 
transmitted symbols gives precisely Forney’s upper bound. The fact 
that the resulting upper bound converges for No small enough (for 





| 
| 
| 
H 


Fig. 4—The actual relationship of the four signal points defined in the text. 
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N = o) has been recently discussed by Foschini.* This last step is an 
important one in a full proof, and was overlooked in the initial work.” 
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We present the power spectrum of a sinusoidal carrier, frequency 
modulated by a random baseband pulse train in which the signaling-pulse 
duration is finite and the signal pulses may overlap and have different 
shapes. Symbols transmitted during different time slots are assumed to 
be statistically independent and identically distributed. The spectral 
density appears as a Hermitian form suitable for numerical computation 
by a digital computer. Simple conditions in terms of the modulation 
parameters are given under which discrete spectral lines are present in the 
spectrum. Several examples are given to illustrate the method. 


I. INTRODUCTION 


In recent years, digital-frequency and phase-modulation techniques 
have been increasingly important in radio, waveguide, and optical 
communication systems. 

An important parameter in the statistical description of a signal is 
its spectral density, which defines the average power density of the 
signal as a function of frequency. In addition to furnishing an estimate 
of bandwidth requirements, the knowledge of the spectral density is 
also essential in the evaluation of mutual interference between 
channels. 

In this paper, we extend the techniques developed in Ref. 1 for 
digital psx to the case of digital Fsk with phase-continuous transitions, 
such as may be obtained at the output of a voltage-controlled oscillator 
driven by a digital baseband wave. We assume that the sinusoidal 
carrier is frequency modulated by a random, baseband pulse train in 
which the signaling pulse duration is finite and the signal pulses may 
overlap and have different shapes. It is generally assumed that the 
symbols transmitted during different time slots are statistically inde- 
pendent and identically distributed. 

We express the spectral density of such an ensemble of continuous- 
phase, constant-envelope, digital, Fm waves as a compact Hermitian 
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form that provides an appropriate division between analysis and 
machine computation. The present work permits simpler numerical 
computation of digital rsk spectra than earlier studies,?—° and contains 
a simpler statement of the conditions, in terms of the modulation 
parameters, that determine whether discrete spectral lines are present 
in the spectrum or not. 

Examples give the spectra of binary and quaternary FSK waves with 
overlapping baseband modulation pulses of several shapes. 


Il. M-ARY FREQUENCY-MODULATED SIGNALS 
We seek the spectrum of the digital frequency-modulated wave: 


a(t) = cos[2rft+¢@)], f. > 0. (1) 
6) = f falu)de. (2) 
fat) = Pe h(t -kT), s =1,2,-°:,M. (3) 


The symbol ¢ is the phase and fa the frequency deviation of the carrier at 
frequency f,. The signaling alphabet consists of M waveforms hi, 
ho, +++, hu, that may have different shapes; one of these 1s transmitted 
for each signaling interval of duration 7. The different signaling 
waveforms in (3) may overlap, but are statistically independent in 
most of the present work; i.e., s; is statistically independent of s; for 
kA l. 
Define for convenience 


OU) Serer (4) 
then 
a(t) = Re {e?*Fety(t)}. (5) 
The spectral density of v(t) is 
Pf) = [ &e(r)e*¥edr, (6) 
where 
Sooper 1 A 
&,(r) = 6 (i, 7) = lim = J &,(t, r)dt, (78 
Ave 2A —A 


D505) OM ae a) (tee ey) 


= Cexp (5 f falu)du ))- (8) 


denotes average on é throughout. 





§ The symbol 
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An arbitrary constant of integration is implicit in the notation of 
(2), which defines the phase. This is of little consequence in the re- 
mainder of the present paper, since most of our study is directed to- 
ward the spectrum of the complex wave v(t); as seen by the final line 
of (8), the absolute phase is irrelevant in determining P,(f). The 
absolute phase must be rendered explicit only for the following three 
purposes in the present work: 


(2) Relating P,(f) to P,(f) (the spectra of the real FSk wave x(t) 
of (1) and of the complex Fsk wave of (4), respectively). 
(22) Separating possible line-frequency components from the Fsk 
wave. 
(272) Specializing the present Fsk treatment to the prior PSK results.! 


To make the absolute phase explicit, we write 
t 
60 = [ fawddu + 60); v(0) = 0, (9) 


The term ¢(0) is the phase of the FSK wave at t = 0. We consider three 
representative assumptions for ¢(0): 


(2) @(0) deterministic, e.g.: 
¢(0) = 0. (10) 


(22) @(0) random, uniform, and independent of the modulation s;: 
Pr [¢ < ¢(0) <otdg}=S, (2929 4H 


(212) #(0) dependent only on the modulation parameters (or signal- 
ing pulses) that contribute to fz(0+)§ in (3): 


—kT 
oO = Ef ha lade. (12)8 

kthe,(—kT'+) #0 J—o 
The signal-pulse duration is assumed finite; consequently, in 
(12), the >> has a finite number of terms, and the lower limit 
on the £ becomes finite. Specific examples of (12) appear in 
Section IV below. In other words, the net phase contributed by 
all past signaling pulses that are over by ¢ = 0, 1.e., for which 
hs,(—kT+) = 0,3 is normalized to 0. 


’ The -+’s indicate that fa and h,, are to be evaluated an infinitesimal time later 
than 0 and —kT, respectively, if any of the signal pulses have discontinuities at their 
upper limits at a time-slot boundary (see Section IV). If the signal pulses and, hence, 
ri see pa frequency deviation fa(t) are continuous, these two +’s may be 

ropped. 
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In the appendix we show that, except for special modulations with 
low carrier frequencies f, that take on special values, the following 
simple relation gives the spectrum of the real wave x(t) of (1): 


PAD aod Je) ae Pod Je). (13)8 


The first term of (13) is the spectrum of the complex baseband wave 
v(t) shifted to the carrier frequency +f,; the second term is the spec- 
trum of v*(t) shifted to —f,. More specifically, (13) is valid if any of 
the following are true: 


(1) Eq. (11) holds, independently of any other considerations. 

(iz) Eq. (10) or (12) holds, and P,(f) [and hence P,(f) ] has no 
line components. 

(222) Eq. (10) or (12) holds, P,(f) [and hence P,(f) | has (equally 
spaced) line components, and f, is low enough so that the two 
terms of (13) overlap, but f, is such that the line components 
of the two terms of (13) do not coincide. The discrete values 
forbidden to f, under these conditions are given by (146) in 
the appendix. 

(2v) The carrier frequency is so large that the two terms of (13) do 
not significantly overlap, independently of any other 
considerations. 


Consequently, we study only P,(f) throughout the remainder of this 
paper. This suffices for all cases except that of a low carrier frequency 
f. that takes on special discrete values related to the baud rate 1/T 
and the modulation, for very special modulations that result in line- 
spectral components (PSK is one such case, but there are others). 

The condition expressed by (12) is assumed in much of what follows. 
This results in no significant loss in generality in treating the spectrum 
of the real wave x(t), as discussed above. It permits economy of nota- 
tion in the general case, a convenient treatment of line components 
present in special cases, and simple specialization of the present FSK 
results to prior PSK results.! 


Hl. NOTATION AND STATISTICAL MODEL 


We introduce the vector-matrix notation of the prior Psk study.’ 
Since that study allowed correlated modulation parameters s; while the 
present work is largely restricted to independent s,, only a portion of 
Section IIT of Ref. 1 need be summarized here. 


§ Compare (5) of Ref. 1. 
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We write (3) as 


fal) = Sanh — kT), (14)8 

where 
an = acl’ = (af? aP --- af], (15) 
h(é) = h(t) ]! = LAi(t) halt) «++ har (t) J. (16) 


For a given k (1.e., for a given time slot) one of the a;’s is unity and the 
rest are zero: 
a = 1; aff =0, 1+ &. (17) 


Thus, a; is a unit basis vector, 1.e., a, has one component unity and all 
other components zero. 

The modulation process s; is assumed stationary, as in the prior 
PSK study,! but here we make the stronger assumption of independence 
in most of what follows. Define the first-order probability 


Ww; = Pr { Si = 1} (18) 


as the probability that the 7th signaling waveform is transmitted in 
the kth time slot. w; is independent of k by stationarity. By 
independence, 


Pr {s, = 7,81 = j} = ww; k #1. (19) 
Then 
wi= Prjia,=[0 0 :-:- 0 1 0 --+ O J}- (20) 
Lo 2 os. 1 4 SFL + mM 


Normalization of the total probability requires 


2 wi=l. | (21) 


i=l 
We use the following vector notation for the probabilities : 


w= wl) =[ws we--- wa]. (22) 


hme 


§ The following notational conventions are adopted throughout: 


(t) Boldface quantities denote matrices. 
(77) Row and column vectors are distinguished by the additional notation . and 
], respectively. 

(222) Ordinary matrix multiplication is indicated by -, Kronecker matrix products 
by X (see second footnote, page 908, Ref. 1, for properties of Kronecker 
products used throughout the present paper). 

(7v) The transpose of a matrix is indicated by ’ . 

(v) The Hermitian transpose of a matrix is indicated by f. 

(vt) Multiple Kronecker products are indicated by IIx (see footnote, page 911, 
Ref. 1) and the Kronecker power is indicated by an integer exponent enclosed 
in square brackets. 
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Further define a vector (here of dimension /) with all elements unity 
as | 


l=1y2(1 1-:- 1]. (23) 
Then (21) may be written 
1-w] = w;1] = 1. (24) 
Finally, for convenience later, define the diagonal matrix 
W1 0 
Wi = oe (25) 
0 WM 
Then, from (22) and 23), 
wal] =w], 1-we =, (26) 
Note that . 
(ax) = Wy (27) 
(ax ]-ax) = wae (28) 


IV. FSK AS A BASEBAND PULSE TRAIN 
We seek an expression of v(t), given by (4), (9), (12), and (14), of 
the form 


»(t) = LD ovr — kT)] (29) 


for signaling pulses of finite duration. Assume the h;(é) of (8) are 
strictly time-limited to an interval KT, as follows : 





Ue = oO, ‘ts oe t > Ur (30)§ 
eee sods: 
Lk = 2 
LK K ze _ 
— T | ie even. , 
tee (31) 
‘Kt T.-. Koda. 
Ux = K 
5 T, KK even. 


Lx and Ux are respectively the lower and upper limits of the pulses. 
Figure 1 shows portions of fa(t) for four different maximum signal- 
pulse durations; the terms k = —1, 0, 1, 2 of (14) are shown, and for 


§.0,= 0]’ is a vector of appropriate dimension (here M) with all elements zero. 
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Fig. 1—Frequency modulation for different signal pulse durations. Index k is 
shown near peak of each pulse. Also, for simplicity, same signal pulse is shown for 
each k. T = time slot duration or signaling period. KT = maximum signal-pulse 
duration. Note different pulse center location for odd and even K. 


convenience a; has been taken the same for each of these time slots. 
The pulses are positioned along the time slots such that the limits of 
each signal pulse lie on the boundary between adjacent time slots (i.e., 
¢t = integer: 7); this results in different definitions for Lx, Ux for K 
even and odd. Since symmetric pulses have been chosen for illustration 
in Fig. 1, their maxima are centered in the time slots for AK odd, and 
lhe on the time-slot boundaries for K even. Discontinuities are permitted 
at the pulse edges (and elsewhere), but are not present in the example 
of Fig. 1 (and would not normally be present in a mathematical model 
of a physical system); discontinuities at the time-slot boundaries are 
restricted by the inequalities at the right of (80). Examine the (0, T'] 
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time slot in Fig. 1 as typical; then the number of pulses contributing 
to fa(t) at every instant equals K.$ 

It remains for us to express the pulse shapes r(¢) and coefficients c;, 
of (29) in terms of the signal pulses h(t) and coefficients a; of (14). We 
give separate treatments for the cases K = 1 and K = 2, and extend 
these results to general K. The treatment is an extension of that for 
the PsK case, given in Section IV of Reference 1. 


4.1 Nonoverlapping pulses: K =1 


The top portion of Fig. 1 shows digital frequency modulation for 
which the signal pulses in different time slots never overlap; in this 
case from oy to- (31), 


h(it)]=0], «50, t>T. (32) 
Define 


| exp (; [tatu ) exp (7 fy hetuau 
sip (; [/taddu) |, 0<t<T. 


a(t) = (33) 
0, t<0, t>T. 
Equations (4), (9), (12), (14), and (33) yield 
#(0) = 0; v@)=L. ~ BA) 
o() =D Siar-a(t — kT)I (35) 
where 


| 
II acsaq*(T)], k <0. 


t=k+1 


S, = 1 k=0 | (36) 


aiq(T ) |, k > 0. 


a 


Comparing (35) and (29), the parameters of the latter are given as 
follows for nonoverlapping signal pulses: 


= Sibi, De = ax; 


ri) ] = a@1J; 
S; in (37) is, of course, given by (36). 


K = 1. | (37) 


§ The same conventions were used in Fig. 1, Ref. 1, for the baseband modulation 
pulses in digital psx. 
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4.2 Overlapping pulses: K = 2 
This case is illustrated in the second portion of Fig. 1. In the (0, 7 ] 
time slot, the k = 0, 1 pulses contribute. We have from (30) to (31) 


h?t)]=0], «= —-T, t>T. (38) 
Define 


Jexp (5) ha(uau exp (5 [howe ) 
exp (jf lade) |> —7 <es 7. 
qt) = (39)8 
0, feet. toe. 


Lt 


Equations (4), (9), (12), (14), and (39) yield 
6(0) = fo ha lwddu, (40) 
v() = SY Srlan-ad — kL) I} aerate — + YT)}, (1) 


where S; remains as given in (36), the same as for the prior K = 1 case. 
Proceeding exactly as in (51) of Ref. 1, (41) above yields (29) with the 
following parameters, when no more than two signal pulses overlap: 


Ci, = SxDiy Da = ax, X aesy; 


ri) ]=q®jxaqt— T)]; 


S;, 1s given by (86), and X denotes the Kronecker product [see foot- 
note to (14) ]. The term by, like a;, is a unit basis vector, i.e., it has one 
element unity and the remaining M? — 1 elements zero.’ Note from 
(42) and (39) that 

r7é)]=0], ¢S$0, t>T. (43) 


Binary FSK offers a simple example of these results, governed by the 
same relations between a, and b; as for binary Psk given in (57) of 
Ref. 1. 


K = 2. (42) 


4.3 Overlapping pulses: general K 


The general case follows by straightforward extension of the above; 
see (58) to (62) of Ref. 1. Figure 1 illustrates the frequency modulation 
for K = 3, 4. The modulation pulse restrictions are given by (30) and 


§ Comparing (39) with (33), note that the definition of q(t) is different for different 
kK; q(t) ¥ 0 over the same interval in which h(t) may be nonzero. 
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(31). Define 


exp (7 [. In(u)du.) exp (; [. halu)dy 


begs (; i ha(u)du) | ie Lie Ux. 
a(t) = : (44) 


aioe 0, pee. Pee 
(0) of (12) is 
(K —3)/2 —kT 
J he,(u)du, Kodd, K > 1. 
km —(K—-1)/2 J bg 
¢(0) = (45) 
(K — 2) /2 —kT 
i h,,(u)du, Keven, K>0. 
k= —(K—-2)/2 Jig 


The parameters of (29) are 
(K—~1)/2 


x aii, K odd. 
i=—(K—1) [2°77 
©,= Sibi; by = (46) 
K/2 
aii, KH even. 
x, ait 
(K—1)/2 
IIx a@é—7T)], K odd. 
j=—(K—1)/2 
r(t)] = (47) 
K/2 
IIx a(é—7T)], K even. 
1=— (K--2) /2 


II x denotes a multiple Kronecker product [see footnote to (14) ]. S; 
is given by 





0 
IT avcepyyeq*(Ux) J, & <0; 
sae kbl 


S, = 1, k=0; K odd. (48) 


k 
IT av cxzns2-q(Ux) J, k > 0. 
i=l 





0 


aicxj2)'Q*(Ux)], k <0; 
Sa 


w=k+1 


S, = i, k=0; K even. (49) 


k 
II aecxjey-q(U x) |, k > 0. 
i 
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Note that 
r7i)J=0], ¢s0, t>T. | (50) 


4.4 Discussion 


It is instructive to obtain the prior psk results! by specializing the 
present FSK results. Define the phase shift produced by each signaling 
pulse as 


t 
gi) = fo helu)du, 1 = 1,2, +++, M (51) 
K 


or in vector notation 


gO1= f bW)Ien, (52) 


and substitute into (44)* (or into (33) or (39) for special cases K = 1, 
2). Then the present results of (29), (31), and Section 4.3 are similar to 
the former PSK results of (48) and Section 4.3 of Ref. 1, except for the 
factor S;,; in particular, the equations for q(t), b;, and r(¢) have an 
identical form. The additional factor S, present in the FSK case ac- 
counts for the total phase shift introduced by each of the signaling 
pulses. 

To specialize the present FSK results to the PSK case, we require the 
total area of each of the present modulation pulses to be zero. Thus, 
(51) and (52) become 


U 
gi(Ux) = i ” Aitdt = 0, 2= 1,2, +++, (53) 88 
Lx 
or 
UK 
g(Us)]= f° ha = 04, (54)¥* 
K 
Substituting in (44), 
q(Ux)] = 1]. (55) 
Therefore, (17) yields 
as-q(Ux)] = 1. (56) 
Substituting in (48) to (49), 
Sk = l, all k, K, (57) 


completing the specialization of the present FsK results to the psk 
case.} 

In the general Fsk case, the results of Sections 4.1 to 4.3 above reduce 
the rsK problem to determining the spectrum of (29). This is ac- 


§ Only the range Li < t S U; is relevant in (51), (52), since q(t) = 0 outside this 
range. 

§§ Equations (30) to (31) and (53), (54) render (51) and (52) zero for ¢ S Lx, 
t = Uk, satisfying (58) of Ref. 1. 
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complished by deleting (149), (153), and (163) to (171) of Ref. 1, 
Appendix B.’ Then the spectral density of (29) above is given by setting 
b — cin (150) to (152) and (160) to (162) of Ref. 1 as follows: 


PS) = FRW-PAGT)-R*)]. (58) 
P.(f) = x erin, (n) | (59)8 
®.(n) = (Cisn]-Cx) (60) 
RN] =RO = [Peo dat, (61) 


In these relations R(f) is the Fourier transform of r(é) of (37), (42), or 
(47), depending on K (i.e., the amount of pulse overlap). ®c(n) is de- 
termined from (36) and (87), (86) and (42), or (46) and (48) or (49) 
in Section VI for K = 1, 2, and general K, respectively. 


V. FSK WITH LINE COMPONENTS 
From (21), (22), and (44), 


weq(Ux) J] $1, (62888 


We show in the present section that equality in (62) corresponds to 
the presence of line components in the Fsk spectrum. Conversely, if 
the inequality of (62) is satisfied, we see in Section VI that the rsx 
spectrum contains no line components. 

Assume throughout the remainder of the present section that 


\w-q(Ux)]| = 1. (63) 
This yields 
UK 
i h,(t)dt = 2rf; + integer-27, 71 = 1,2,---, M, (64) 
LK 


where 27f; is defined as the common area (mod 27) of all of the Fsx 
signaling pulses in the line case. For definiteness we take 


—7< fiz. (65) 


Equivalently, 
q(Ux)] = e®ft1], —3< fi S 3. (66) 


§ These deleted portions were relevant to the study of line spectral components of 
psk.! The line spectral components of Fsk are treated separately in Section V. 

$§ See footnote, page 905, Ref. 1. 

§§§ This follows because |q:(t)| = 1. 
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Now substitute (66) into (48) and (49); we have 
S, = efi allk, K. (67) 
Thus, when (62) is satisfied 


v(t) = = eS be -r(t — kT) I. (68) 


The relation (68) is the same as the psk result of (43) of Ref. 1 
except for the factor es; as noted following (52), b, and r(t) have 
the same form in FsK and psk. Consequently, when (63) is satisfied, the 
FSK spectrum may be obtained by simple transformation of the psk 
results of Ref. 1. One way this may be done is to rewrite (68) as 


v(the2r sur yt = 2 bi: (e~2* FUT) “kT Ig (f — kT) )). (69) 
Comparing with (48) of Ref. 1, all the psx results of Ref. 1 apply 
directly to the present case by making the substitutions 


v(t) =} v(t)e 227 fui Tt, 
r(t)] > eB UUTDt E(t), 


Therefore, when (68) is satisfied, and consequently (68) holds, the 
FSK wave has line spectra. We separate the line and continuous com- 
ponents as 


(70) 


v(t) = vit) + v(t). (71) 
The line component is given from (70), and (114)$ of Ref. 1, as 


vi(t) puis. > R (“4 +h )| ei2a (nts / Tt 


n=—- 7 


1 , n+ fi rs 
Pal) = pa lWeE RI? D3 (t-"F“), 


The spectral density of the continuous component is found from (70), 
and (69),5§§ (96), or (116) of Ref. 1 as follows: 


PAU = aR PRN) (73) 


§ Or from (66) and (95) of Ref. 1 for the special cases k = 1, 2. 

§§ See footnote to (14); an exponent enclosed in square brackets denotes the 
Kronecker power, 1.e., the Kronecker product of a matrix (or vector) with itself the 
indicated number of times. 

§3§ This is equation (69) of Ref. 1 specialized to independent signal pulses, i.e., 
with (32) used for all n # 0, yielding (72) or (73), of Ref. 1. If this restriction is not 
imposed, and (70) above is used in the general form of (69) of Ref. 1, we obtain the 
spectrum of digital rsk with line spectra, with nonoverlapping signal pulses having 
arbitrary correlation (rather than being independent, as in the remainder of the 
present paper). This is the only correlated case that can be readily treated by the 
present methods. 
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{| }=Wae-—wlw, K=1. (74) 


{ } = wh — w?l.wil 
+ (w X Wa X w | ae w |! wll) e-2a FT—f1) 
+ (w | X We XW, mae, w fl?! WE)eterGT—-fi) K = 2. (75)8 


expression in { } in (116) of 


| } = pet 1 with ff fT — fy generat ts: (76) 


The condition of (63), that has here been shown sufficient, is shown 
in Section VI also to be necessary for line components to be present 
in FSK spectra. This condition has a simple physical interpretation. 
Every signal pulse must introduce the same total phase change in the 
modulated carrier in order to have line components. This phase change 
has been denoted as 27f,; the line components appear at frequencies 


#(f.+25*): n= ---,—1,0,1,---. (77) 

When f/f: = 0, (73) to (76) show that the Fsk spectra are identical to 
the prior PSK results.! However the wave in this case is not necessarily 
a PSK wave. The stronger condition of (53) or (54) 1s required to have 
a PSK wave; this condition demands the net phase shift introduced by 
every signaling pulse to be zero. A wave with f; = 0, but one or more 
signal pulses with net phase change equal, for example, to +27, will 
have a spectrum given by the psx formula, but will not be a psk wave. 


Vi. FSK WITH NO LINE COMPONENTS 


Assume throughout this section that the inequality of (62) is 
satisfied : 
\wa(Ux)]| <1. (78) 


We demonstrate that under these conditions the Fsk spectrum contains 
no lines. The properties of Kronecker products given in Ref. 1 in the 
second footnote, p. 908, and in the first footnote, p. 915, are used 
throughout without further comment. 


6.1 FSK spectrum with no line components: nonoverlapping pulses, K = 1 


From (37) and (60), 


B.(n) = ((Sr+nSz) (@r+n]-ax)). (79) 
From (36), 
SipnSi = > ani a(T)], n> 0. (80) 


§ See footnote to eq. (72). 


1108 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1975 


Let us first consider ®,(0). Since S,S; = |S;,|2 = 1, for all k, we 
have from (79) and (28) 
®.(0) = Wa. (81) 


Next, (79) and (80) yield 


Gn) = | Tacssa(2)I} (a2) laeeed-ay}, mB 1, 82) 


where we have split off the first factor of (80). For n = 1, the first 
{ }, containing the [[, is dropped. The first { } is independent of the 
remainder of the expression by (19); using the fact that q(7’)-ax] is a 
complex number or equivalently a 1 X 1 complex matrix, and using 
(27) and (28), 


®.(n) = {w-q(T)]}" aren] q(T), ax ]-ax) 


iw-a(7’) Jj" {w]-q(7)-wa}, n 2 1, (83) 
the last step following from the independence of a;., and ax. 
Finally, taking the Hermitian transpose’ of (79), 


E(n) = ((St+wS1) (@x]-Anen)). (84) 
Alternatively, setting n — —vn in (79), 
®, ( —n) = ((SSp—n) (Bey) “ax) ys (85) 


Since (79) has been shown independent of k, we may set kk +n 
in (85), to yield upon comparison with (84) 


®.(—n) = @}(n). (86) 
From (58) to (61), (81), (83), and (86), 
P.(jT) =A+ At, (87) 


A= hwy + Pvtw)-g(P)-we D (ee? weq(T)]}. (88) 


Because of (78) the geometric series in (88) converges, and 


e~Pas? w ]-q(T)- Wa 


1 
Ao= 9 Wa 3 is e-PT w-q(T)] (89) 
Finally, the spectrum of the complex FSK wave of (4) is 
1 
PF) = PRO), (A + A) RES) I, (90) 


with A given by (89) for nonoverlapping pulses, K = 1. 


§ See footnote to (14). 
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It is clear that the restriction of (78) renders A and, hence, P,(f) 
finite for all f; consequently, there can be no line components in the 
FSK spectrum. If we take the limit as |w,;q(7) ]| — 1, and substitute 
(66) into (88), (90) yields directly the appropriate results for the line 
case, 1.e., (72) and (73) to (74) of Section V. 


6.2 FSK spectrum with no line components: overlapping pulses, K = 2 


®.(n) = ((SzinSi) (di ]-bx)) (91) 
with 
bi, = 84,% Abs (92) 


We have noted in Section 4.4. that the present b; for rsx has identical 
form to the prior bz of Ref. 1 for psx. In Section 4.2, we saw that S; is 
identical for K = 1 and K = 2; therefore, (80) applies to the present 
case as well. Comparing the psk analysis of Section VII of Ref. 1, we 
evaluate (91) above by inserting (80) above inside the (_) in (87) of 
Ref. 1. 

For n = 0, (87) and (90) of Ref. 1 yield 


®.(0) = wl?l, (93) 
For n = 1, (87) and (91) of Ref. 1 yield 
®.(1) = ({a(T)-ae]} & ay X (ari) X ays} X acy) 


= ({q(T)-ax]-ax} & {axiil-arii} X arze]). (94) 
Since a;, az41, and a4. are independent, (27) and (28) yield 
(1) = {q(T)-wa} X wa X wi. (95) 


Finally, substituting (80) inside the ( ) of the third line of (87) of 
Ref. 1, 


&.(n) = (| TT aess-a(7)]| fa(T)-ax]} {q(1),- ang c]} {aean] ar} 


x fAninsi| anss) n= 2, (96) 


where we have split off the first two factors of (80). For n = 2, the 
first { }, containing the [J, is dropped. Regarding the factors q(T) - ax ] 
and q(T’) -az;1 | as complex numbers or alternatively as 1 X 1 complex 
matrices, and using the independence of the different a;,, (96) yields 


@e(n) = {weq(Z)]}"-*( {arin ]-a(2) ax] -ax} 
X {arpngel q(T): ar+i]- acts} ) 
= {w-q(T) }}"?{w]-q(T)-waf™!, n 22. (97) 
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The remainder of the analysis proceeds as in Section 6.1. The rsx 
spectrum for overlapping pulses with K = 2 is given by 


P.(f) = TRU) (A + A) RANI (98) 


where 


es2afT oe 
no Df erga x ws) 4 RD We 


=e Iw (7) 
(99) 


P,(f) again contains no spectral lines by (78); the limit |w;q(7) ]— 1 
again yields the appropriate results of Section V for the line case. 


6.3 FSK spectrum with no line components: overlapping pulses, general K 


Similar analysis yields the generalization to any overlapping signal 
pulses. From (46) and (60), 


#.(n) = ((SeynS%) (be J -by)) (100) 
with b; given by (46) and S; by (48) to (49). Thus, 


“ K+1 
IT ar-xen/ai'd ( - r) | k odd, 








IT Ak—(K/2)45°O (5 r)| ’ k even, (101) 
j=) 


Since the present b;, have identical form to b; of Ref. 1, (100) is 
evaluated by substituting (101) inside the «_ ) of (106) of Ref. 1, and 
making corresponding changes in the remainder of Section VII, Ref. 1. 
The factor 


II a;-q(Ux) ] (102)§ 


j=1-— 


is inserted inside the (_ ) of (107) of Ref. 1. In the following we recall 
that the factors of (102), aq] = ga], may be regarded alternatively 
as complex numbers or as 1 X 1 complex matrices. 


In (109) of Ref. 1, the first factor of the second line is modified as 


(ai) > g(Ux), (ai].ai) = g(U x): Wa; (103) 
consequently, 
®.(1) = {q(Ux)-wa} X whF-4 & wi. (104) 


In (110), Ref. 1, the first two factors of the fourth line are modified 
§ Ux is given by eq. (81). 
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as in (103), yielding 
®.(2) = {q(Ux)- wa} X wi?) X w]e, (105) 
By induction, 
®.(n) = {q(Ux)-wa}™ X wi X wl, nSK. (106) 
Next, for n = K, inserting (102) inside the (_ ) of the second line 
of (107) of Ref. 1, associating the factors of (102) with corresponding 
factors of the second [| of (107), Ref. 1, with n — K factors of (102) 


left over, and then noting that all factors have different indices and, 
hence, are independent, 


®,(n) = w]!*l-{q(Ux)-wa} *l{q(Ux)-w]}"-* 
= iw,q(Ux) ]}7-*tw]-q(Ux) wa}, n2K. (107) 


Finally, from (108) of Ref. 1 and (100) above, 


&.(0) = wh*. (108) 

The Fsk spectrum for overlapping pulses with general K is given by 
] 

Pf) = FRU): (A + AN-RANI, (109) 


where 


A = 5 wie _- > e—InanfT q(Ux) wh x whk-7] x w ff 

{e-**/7W] (Ux) wa} 
The condition (78) again guarantees no spectral lines; as equality is 
approached in (78) the present results approach those of Section V for 
the line case. The condition (63) is, therefore, necessary and sufficient 
for P,(f) to have line components. 


(110) 


Vil. ILLUSTRATIVE EXAMPLES 


The computation of the digital rm spectral density from the above 
methods is straightforward. For a given set of baseband signaling pulse 
shapes and symbol probability distribution, we determine K, the over- 
lap parameter, and the probability row vector.w, We then evaluate 
q(Ux) and w;q(Ux) ]. 

If |w,q(Ux)| <1, we know that there are no line components in 
P,(f). The continuous part of the spectrum is evaluated from the ap- 
propriate Hermitian form given in Section VI. 


If \w-q(Ux) ]| = 1, we know that there are line components in the 
spectrum, and also that 
q(Ux)] = e?*1-1], | fil S 3. (111) 


We determine f;, and then P,,(f) and P,,(f) from the methods given 
in Section V. 
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In the following examples, the digital computer is programmed to 
work directly with the Hermitian forms (both ordinary and Kronecker 
matrix multiplications are performed by the computer). In this way, 
complicated cases involving multilevel signal pulses overlapping 
several time slots may be simply treated. 

The case of rectangular-pulse rFsK modulation is treated in Ref. 8, 
and consequently will not be considered here.’ 

We make the following assumptions for convenience; none are 
essential. 


(z) The number of frequency levels is a power of 2, 
M = 2%, N an integer. (112) 
(it) The M baseband signaling pulses have a common shape; 
gu(t) = Cig (t). (113) 
(212) All signal pulses are equally likely; 


1 1 
where Iy is the identity matrix of order M. 


7.1 Raised-cosine nonoverlapping signal pulses: K = 1 


If the pulses have a common raised-cosine shape, 


Ai 
_ rfa|1 — cos = | O0O<tsT, fa>0, 
Amu 
g(t) | = 
0 |, otherwise, (115) 
where Ay, Ao, ---, Aw are the peak-frequency-deviation parameters of 
the FSK signals. We assume that A; = 1, Az = —1, As = 3, Ag = —3, 
-+, Aw = —(M — 1). 
Since K = 


R,(f) = T zx ie (ES) e- st f— (Fadel T 
sin x (JT — fait +- n) 
1 ( pr — er n) 


where J,,(xz) is the Bessel function of the first kind and of order n. 


» (116) 


§ This is the only discrete-frequency-modulation spectrum given in Ref. 3. 
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7.1.1 Raised-cosine signaling with no line spectrum: K = 1 


Since 
q(Uy) = etAitfaT eiderfaT ... eiAMrfaT (117) 


line spectra are absent if and only if f27' is not an integer. In this case, 


Pf) = Pa) = ARU) (A + AN RUD (118) 


(0.350) 


NO 


fyT=0.50 


/ 


a 
17 
7 0.75 
/ 
/ 
/ 
( 70.25 
/ 
oo 
(0.0171) 


4 
| 1.00 
“” 
7 
\ F [\ (0.00614) 
eee SZ JI. 
0 0.5 1.0 1.5 2.0 2.5 3.0 
{T 


SPECTRAL DENSITY P,(f)/T 





Fig. 2—Spectral density of binary rskK system with raised-cosine signaling and 
pulse duration 7. K = 1. 2f4 is the spacing between two adjacent a priori chosen 
frequencies. 
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(0.198) 


(0.194) 


SPECTRAL DENSITY P,(f)/T 


(0.0481) 





Fig. 2 (continued). 


where R(f) ] is ~ by ae and 
es? 1)-q(Uy), 
ial om} oy A sin Ma fal 
M sin rfaT 


For M = 2, 4, and 8 and various values of k = faT, P,(f) is plotted 
in Figs. 2, 3, and 4. 


(119) 


7.1.2 Raised-cosine signaling with line spectrum: K = 1 
The FM spectral density P, (f) contains lines if and only if |w,;q(U1) ]| 
= 1; that is, if and only if 


ioP S19, 8 ac, (120) 
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(0.0199) 


SPECTRAL DENSITY P, (f)/T 


(0.141) 


(0.140) 





0 0.5 1.0 1,5 2.0 2.5 3.0 


Fig. 3—Spectral density of quaternary rsk system with raised-cosine signaling 
and pulse duration 7. K = 1. 2f4 is the spacing between two adjacent a priori chosen 
frequencies. 
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(0.164) 


SPECTRAL DENSITY P,(f}/T 


(0.0991) 
(0.0969) 


fyT=2.00 





Fig. 4—Spectral density of octonary Fsk system with raised-cosine signaling and 
pulse duration T. K = 1.2f¢ is the spacing between two adjacent a priort chosen 
frequencies. 
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In this case, 


fi = 4, faT = 1,3,5,---, 
fi on 0, faT as 2, 4, 6, 


From Section V, 


P,(f) aa 


SPECTRAL DENSITY P,(f)/T 


1 1 9 = nN + fi 
ay ROI 5(J —Bak ) 
+ pRB + BRN! 


(0.221) 


_— fyT=0.125 


_- 0.350 
1.0 
_- 0.500 
0.5 
\ (9.568X 107°) 
LY (1.559X1078) 
a 7 NN 
lL % Z/7~| SS 
0 M aa 1.5 2.0 2.5 
fT 





(121) 
(122) 


(123) 


3.0 


Fig. 5—Spectral density of binary rsk system with raised-cosine signaling and 
pulse duration 27’. K = 2.2f4 is the spacing between two adjacent a priori chosen 
frequencies. 
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(0.182) 


— 
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q 
oc 
© 
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(0.0125) 
D</\ = 1.000 
0 1.5 2.0 2.5 3.0 
‘T 
Fig. 5 (continued). 
where$ 
B+ Bt = : l 
+ Bt = Wy i" — 77 Ee (124) 


For Jf = 2, 4, and 8 and for some integral values of faT, P,(f) is 
also plotted in Figs. 2, 3, and 4. 


§ Note that B + Bt for K = 1 is given in (74). 
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(4.064X 1072) 


vy faT=0.125 


(2.673X 107 2) 


SPECTRAL DENSITY P, (f)/T 


- (1.192X107-4) 





Fig. 6—Spectral density of quaternary rsk system with raised-cosine signaling 
and pulse duration 27. K = 2.2f4 is the spacing between two adjacent a priori 
chosen frequencies. 
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(3.206X 107 2) 


(2.87X107 2) 
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fyT=0.625 va 
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/ 1.98X107 2 
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SPECTRAL DENSITY P,,(f)/T 





Fig. 6 (continued). 


7.2 Raised cosine overlapping signal pulses: K = 2 


If a raised-cosine signal pulse just fills up two time slots, 


Ay 
As at 
rfa(1 + cos), —-T <tsT, fa>0 
Age 
g(t) | = 
0 |, otherwise, (125) 


K = 2, and the spectral density may be calculated from Sections 4.2, 
V, or 6.2 according to whether 


lwq(U2)]| = 1 (126) 
or | 
lw,q(U2)| <1. (127) 
If A, = 1, A», = —], A3 = 3, Aa = —3, etry Ayv-1= (M — 1), 
Au = —(M — 1), (116) can be shown to be satisfied if and only if 
2faT is an integer. Further, 
fp=0, a fal = 1,2;3, 2%, (128) 
and 
fi = 3, if faT = 3,3,3) °°°- (129) 
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In this case, note that 
1 sin (2MrfaT) 


w,;a(T)] = M sin QnfyT) (130) 


In this case, the FSK spectrum contains line components, and the con- 
tinuous and line spectra for /@ = 2 and 4 are shown in Figs. 5 and 6. 

If 2f2T is not an integer, the FsK spectrum does not contain any lines, 
and the continuous spectrum given by (97) to (98) for M = 2 and 4 
is also plotted in Figs. 5 and 6. 

Several observations can be made from Figs. 2 to 6. For both K = 1 
and K = 2, discrete spectral lines appear as fa7' approaches the limiting 
value and the power in the lines is substantial for K = 1 and M = 2. 
Also note that power in the line components with K = 2 is smaller 
than the power in the lines with AK = 1. 

For the same value of f27' and K, the principal portion of the spec- 
trum of binary FsK Is narrower than that of quaternary Fsk. For K = 1, 
quaternary FSK spectrum is narrower than that of octonary Fsk spec- 
trum for the same value of fa7’. 

Since lines can appear in the spectrum for a set of values of faT’, 
the FsK spectrum is quite different from the psk spectrum even when 
lines are present in the FSk spectrum. 


Vill. SUMMARY AND CONCLUSIONS 


Matrix methods are given to express the spectral density of a carrier, 
frequency-modulated by a random baseband pulse train, in a concise 
and computable form. Arbitrary pulse shapes may be used for M-ary 
digital signaling, and they may overlap over a finite number of signal 
intervals. 

The spectral density is expressed as a compact Hermitian form 
suitable for numerical computation by a digital computer. The com- 
puter is readily programmed to perform matrix operations directly, 
rather than expanding the Hermitian form and evaluating the indi- 
vidual terms. In this way, quite complicated cases involving multilevel 
signal pulses overlapping several time slots may be treated simply. 

Simple conditions in terms of the modulation parameters are given 
under which discrete spectral lines are present in the spectrum. A 
method is given to evaluate the power in the discrete spectral lines. 
The utility of the method is illustrated by giving several examples. 

The present results are restricted to independent signal pulses, 
except for the very special case of nonoverlapping (K = 1) signal 
pulses with line spectra present [see footnote following (72) ]. In Ref. 1 
we saw that for overlapping signal pulses with K = 2 (.e., signal 
pulses that occupy no more than two time slots), psK with correlated 
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signal pulses required the fourth-order statistics of the digital modula- 
tion process. FSK 1s more complicated; the correlated K = 2 case 
requires statistics of all orders for the modulation process, except in 
the line spectrum case. However, certain special cases with correlated 
signal pulses have been handled by extension of the present methods. 


APPENDIX 
Spectra of Complex and Real FSK Waves 


From (130), (131) of Ref. 1, Appendix A, we see that (13) of the 
present paper holds true if 


emtilb, .(, +) = 0, (131) 


where 
hy «(t, 7) = (v(t + r)v(t)) = (erletetr +e), (132) 


From (3) and (9), 


807) = (or exp Gepp ‘ [- | ha(u)ds.) )- 


(133) 


If ¢(0) is independent of the modulation parameters s;,, (133) 
becomes 


Byye(t, 7) = (226) (exp oe. i a [| hua(u)du.) 


(134) 


Thus, if (11) holds, the first factor in (134) = 0, and (131) and, hence, 
(13) follow immediately. 

If, instead, (10) holds, and imposing in addition the independence of 
the s,, (134) becomes 


Seo) = Tt Cex (i | [or + [| inde) 135) 


For large enough |¢| and finite pulse length, the integrals in the ex- 
ponent fall into three classes, depending on k: 


(t) Both limits on both integrals lie to one side of the signal pulses. 
The integrals equal 0 and the corresponding factor in the in- 
finite product equals 1 and, hence, may be ignored. 

(27) The limits on integrals straddle the pulse, and the limits may 
consequently be replaced by /%%, where the maximum pulse 
length is KT and Lx, Ux are given in (81). 

(277) The common lower limit, or one or both upper limits, lie eaihin 
the pulse. 


SIGNAL POWER SPECTRUM = 1123 


Therefore, for large enough |¢| (depending on the pulse length and 
on 7) 


ee ee ee ee ee (exp ( 2 f ~ hus(udda.) ) 
7 t sufficiently positive. (136) 
ee (es es gee ee (exp ( — 72 i a has(udda) ) 
, sifieiently negative. (137) 


Condition (12) may be substituted for condition (10) with only minor 
changes in the above discussion, and identical results (186) and (137). 


From (44), 
exp (; a 7 ha(u)ds 


aun = [or (: J, 0d) | (138) 


exp (jf, ‘ hac(u)du) 


the integrals in the exponents are the areas of the signal pulses. We 
consider the two cases of Sections V and VI: 


(2) All pulse areas are identical (mod 27); line components are 
present in the spectrum. 
(22) The contrary; no line components are present. 


In case (2), from (6) 
q(Ux) J = e?t-1), —-3< fi 3 3; (139) 


2rf, is the common area (mod 27) of all the signal pulses and 1] is the 
unit column vector of (23). It therefore follows that the final factors of 
(136) and (137) satisfy the following: 


U 
(exp (+ j2 i ° hus(u)ds.) ) = e+tfl, line spectrum present. (140) 
LK 





U 
(exp (+72 J hau) | < il, line spectrum absent. (141) 
LK 


With line spectra absent, (141), (136), and (137) show immediately 
that (131) holds for all f., and, hence, condition (10) or (12) guarantees 
the result of (13) for independent s, without further restriction. 

With line components present, (140), (136), and (137) show that 
®,,«(¢, 7) regarded as a function of ¢ contains a periodic (line) com- 
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ponent. Therefore, we write 


P,, yx GA T) = D, yx G. T)I a P, x(t, T)ey (142) 
where 
®,,«(t, 7), = 0, |#| sufficiently large, (143) 
and 
Dyyx(ty 7), = CMT) Sg, (pemenCtlD), (144)8 


Only the line component (144) can possibly contribute to the average 
on the left-hand side of (131): 


00 A 
es on(r) lim, “ [- 32m Sct (@fl+n)/T] tf 
= . sin 2x[2f, + (2f: + n)/T IA 
. lim — a 
ee. en(7) oe 2n[2f. + (2f; + n)/T JA 
The liima., — 0 if 2f. + (2f: + n)/T ¥ 0 for every integer n, thus 
satisfying (131). Hence, condition (10) or (12) and 


(145) 


2( +5) = @ n= 0,1,2,3,---, (146) 


guarantees the result of (18) for independent s;. Since, by (72), the line 
spectral components of v(t) occur at (f; + n)/T7', (146) is equivalent to 
requiring that the line components of v(t)e?*/«! and of v*(t)e722* fet 
never coincide. Note that condition (5) of Ref. 1 is the special case of 
(146) above for f; = 0. 

Finally, (135) to (140) of Ref. 1, Appendix A apply also in the present 
case, and establish that (13) is a good approximation if f, is high enough 
so that there is no significant overlap between the two terms of (18). 
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Analysis of Trunk Groups Containing Short- 
Holding-Time Trunks 
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This paper presents models for the behavior of trunk groups containing 
short-holding-time faulty trunks. The models, referred to as ordered selec- 
tzon, two-sided selection, random selection, or queuing selection, are 
applicable to selection procedures used by a number of switching systems. 
Each model is analyzed to obtain the fraction of group attempts carried 
on a faulty trunk in the group, and the corresponding fraction of group 
attempts that find all trunks busy (blocking or overflow). Numerical 
results for the basic models are also presented. The results indicate that 
factors such as the trunk selection procedure or the type of group (high 
usage or final) can lead to significant differences in the performance of a 
group containing a faulty trunk. 


l. INTRODUCTION 


A message trunk, the basic connecting link in the switched telephone 
network, provides the supervising, signaling, and ringing capabilities 
essential to call set-up, as well as the communication path. When a 
condition that prevents proper functioning of a message trunk occurs, 
and causes a call failure, the trunk is normally released by a switching 
system on customer abandonment of the failed attempt and is then 
available to fail another call. As a result, a single undetected faulty 
trunk can fail a disproportionate fraction of the offered attempts to a 
group. Figure 9 shows an illustrative case where one trunk, subse- 
quently verified to have been faulty, carried 35 out of 77 attempts 
offered to a trunk group during one hour. Because of their potential 
service impact, such trunks are of major concern throughout the Bell 
System and the object of many preventive maintenance and trouble 
detection programs. 

This paper presents models and analyses for the behavior of trunk 
groups containing short-holding-time trunks. This terminology is used 
to emphasize that the faulty trunks of interest are accessible by the 
customer, as opposed to faulty trunks that inhibit seizure by a 
switching system (false busys) or that result in an automatic retrial on 
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another trunk when an abnormal condition is detected. The models 
were developed to provide tools for quantifying the potential impact 
of short-holding-time trunks, to help determine more effective ways 
to use the trouble detection resources in the network. Although the 
main emphasis here is on the analytical development of the models, 
some numerical results are also presented. The reader primarily 
interested in these results can refer directly to Section III. (Necessary 
terminology and the relationship of the idealized models to switching 
systems are summarized in Section 2.1 and Table I, respectively.) 
A more detailed outline follows. 

The basic models studied, referred to as ordered selection, two-sided 
selection, random selection, and queuing selection, and the switching 
systems to which they primarily apply, are summarized in Table I. 
Justification for the choice of the models is given in Ref. 1. Basic 
assumptions are Poisson input, exponential holding times for both 
normal- and short-holding-time trunks,* and no retrials because of 
blocked or ineffective attempts.’ Within this framework, the impact 
of a short-holding-time trunk depends on its position in the group and 
on the way idle trunks are selected. Impact is quantified in terms of the 
fraction ineffective (fraction of offered attempts carried on the short- 
holding-time trunk) and the blocking (fraction of offered attempts that 
find all trunks busy in the group). These measures enable total ineffec- 
tives for both high usage groups (fraction ineffective) and final groups 
(fraction ineffective plus blocking) to be determined, as well as permit 
the distortion in standard traffic measurements (group usage and 
counts of offered and overflow attempts) owing to a short-holding- 
time trunk to be assessed. It should be noted that blocking, used here 
in a conventional traffic engineering sense, is used by some to refer 
to call failures resulting from any cause. 

In Section II, prior work is briefly discussed and analysis for the 
various models is developed. Terminology common to the models is 
summarized in Section 2.1. In Section 2.2, ordered selection is con- 
sidered. An exact solution for the fraction ineffective is obtained. The 
computations for the special case of one short-holding-time trunk are 
summarized in (14) to (18). Blocking is treated in an approximate way, 
using the equivalent random method. 

For two-sided selection (Section 2.3), the fraction ineffective is 
approximated by using results for ordered selection. The computa- 


* Although our main interest is short-holding-time trunks, some results hold for a 
short or long holding time on one trunk. In these cases, the trunk is referred to as 
abnormal. 

T The models, however, can be used with retrials, if these are also treated as Poisson 
(this approximation was found to be reasonable). However, the inclusion of retrials 
(which are an important factor in considering volume changes) does not substan- 
tially change the service impact of a short-holding-time trunk. 
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Table |— Summary of models for machine trunk selection 


ee Description Switching System” 
Ordered selection Fixed-order hunt for an idle Step-by-step, Panel, No. 
trunk. 4 Crossbar 
Two-sided selection | Traffic split into two parts. Each | No. 1 Crossbar, Cross- 
part uses a fixed-order hunt for bar Tandem, No. 4 
an idle trunk, with the two Crossbar (two-way) 


orders reversed. 
Random selection Equally likely choice of an idle No. 5 Crossbar 


trunk. 
Queuing selection Trunk that has been idle the No. 1 Electronic 
longest is chosen. switching system 


“Groups are assumed one-way (trunks are selected from only one end of the 
group), except for No. 4 Crossbar (two-way), where a two-way group between No. 4 
Crossbar systems is assumed. The applications are not always precise; trunk assign- 
ments to frames, gradings, and certain subgrouping arrangements can affect the 
selection procedure of some systems. 


tions summarized in (25) to (87) give good results for the fraction 
ineffective. Blocking is also treated in an approximate way. 

Random and queuing selection are considered in Section 2.4. It is 
shown that, for fraction ineffective and blocking, these are equivalent. 
Simple exact solutions involving the Erlang B formula are obtained. 
Equation (60) gives the fraction ineffective, and eq. (61) gives the 
blocking. 

Numerical results are given in Section III. For reasonable values 
of normal- to short-holding-time ratios as suggested by field data 
(about 5 to 30, depending on the type of fault and type of traffic), the 
results confirm that a single short-holding-time trunk can have a 
severe Impact on service. For example, Fig. 6 shows the impact of one 
short-holding-time trunk in a group with random/queuing selection. 
It is apparent that the short-holding-time trunk has a significant 
impact over a wide range of conditions. 

For any group, assuming that a short-holding-time trunk is equally 
likely to be in any position,” the various disciplines can be compared. 
We find that (see Fig. 8) 


Average 


.; Average Average 
fraction : : 
fraction fraction 
ineffective : : : 

ineffective ineffective 
for < s 
random or for for 
ine two-sided ordered 
q . 8 selection selection 
selection 


“For random/queuing selection, the position is irrelevant. 
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ie., the random/queuing selection tends to minimize the impact of a 
short-holding-time trunk. Of course, for a fixed position for a short- 
holding-time trunk, either ordered or two-sided selection can have a 
significantly lower fraction ineffective than random/queuing (e.g., 
last trunk for ordered selection). These results suggest that, if ex- 
pected service improvement is used as a criterion, then, all other 
things being equal, there is a higher service payoff in eliminating faults 
from trunk groups in older switching systems than in more modern 
systems. Conversely, the more modern systems provide better service 
for a given level of trunk faults. 

Section 3.5 briefly discusses practical limitations to the model 
assumptions and gives further consideration to Fig. 9. 


Il. ANALYSIS MODELS FOR GROUPS CONTAINING SHORT-HOLDING-TIME 
TRUNKS 


In this section, the equilibrium behavior of groups containing short- 
holding-time trunks is considered. Prior work in this area is limited. 
Klimontowicz’ appears to be the first to give the problem attention, 
considering random selection of idle trunks, ordered selection, and 
cyclic random (sequential with initial starting point chosen randomly). 
He develops some analytic results for special cases such as zero holding 
time on the abnormal trunk, but relies on simulation for most of his 
results. In support of work to detect faulty trunks from operator 
trouble reports,‘ Forys® has considered ordered selection for a trunk 
group with mean holding time dependent on trunk position. In this 
general case, the fraction of attempts carried on any trunk can be 
determined by applying renewal theory, but the results are numeri- 
cally inconvenient. (For completeness, these results are included in 
Section 2.1.) However, ordered selection with a single abnormal trunk 
permits application of a known recursion formula from Ref. 6, to give 
the computationally convenient solution developed in Section 2.1. 

The random selection model has also been considered in the context 
of fault detection from trouble reports.’ Except for limiting cases, 
analytic solutions were not obtained in Ref. 7; numerical solutions of 
the state equations to determine equilibrium occupancy probabilities 
were obtained. For a single abnormal trunk in the group, however, 
a simple exact solution to the state equations is possible. This is given 
in eq. (62). Subsequently, Kaufman® has shown that this solution 
can be generalized to an arbitrary number of abnormal trunks. How- 
ever, the fraction ineffective and the blocking can no longer be ex- 
pressed in terms of the Erlang B formula, which is the case for one 
abnormal trunk. 
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2.1 Terminology 


N Number of trunks in group, considered to be numbered from 


1, 2, ---, N. (For ordered selection, search for an idle trunk is 
from 1, 2, ---, N. For two-sided selection, one traffic parcel 
searches from 1, 2, ---, N, and the other from N, N — 1, ---, 1.) 


Kk =Number of the short-holding-time trunk. 


\ Arrival rate for the Poisson traffic offered to the group. 

u  Hang-up rate for normal trunks, i.e., the holding-time distribu- 
tion is F(x) = 1 — e-4*7,2 = 0. 

fi Hang-up rate for short-holding-time trunk. 

r Normal- to short-holding-time ratio (Z/,). 

a Normal offered load to the group (\/u). (For two-sided selec- 
tion, @ = a, + Q@2, where a, corresponds to the load offered in 
direction 1, ---, N and a2 the load offered in direction VN, N — 1, 
cacy, 


P Fraction ineffective (probability that an offered attempt is 
carried on the short-holding-time trunk). 

B Blocking* (probability that an offered attempt finds all trunks 
busy in the group). 


2.2 Analysis for ordered selectioni 


In this section, we derive results used to determine the effects of 
an abnormal trunk in a trunk group with an ordered selection of 
idle trunks. We first derive results for the case where each trunk has 
a different holding time. We then specialize our results to the case of 
one abnormal trunk because convenient computational algorithms 
are available for this case, and because it is the case of most interest. 

Of main interest is the proportion of offered calls serviced by each 
trunk (in equilibrium). Denote these by Px, K = 1, 2,---, N. The 
Pxs may be obtained by first calculating the stationary occupany 
probabilities. If px is the stationary occupancy probability of the 
Kth trunk, then, using (for example) Little’s Law, one can show that 


Px = uxpr/d, (1) 


where ux is the hang-up rate for trunk K. 

The method used to compute the pxs is conceptually straightfor- 
ward. The interarrival time distribution function of the traffic pre- 
sented to the Ath trunk is determined, and known results for a single 
trunk with renewal inputs are used to find px. 


“The term blocking is used without reference to whether a group is a high usage 
or final. Blocked attempts for finals are also ineffective attempts. 
t This section reproduces and extends some of the results in Ref. 5. 
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Thus, if the interarrival distribution function of the traffic into the 
Kth trunk is Ax(t), and 





ax(s) = [" ertdAx(®, (2) 
ak = in tdA x(t), (3) 

1 
pK > Gene (4) 


then, using the results in Ref. 10, p. 93, 


px = pxL\ — ax(ux) |. (5) 
Hence, the proportion of calls carried by the Ath trunk is 
Px = [1 — ax(ux) ]/axd. (6) 


Following the same argument as in Ref. 10, p. 37, we can obtain 


Ax(t) = i {exp (— wiz) + [1 — exp (ux_i1t) JAx(t — x) }dAx_i(2). 
(7) 


The idea behind the argument is to consider that an overflow from 
the (K — 1)st trunk occurred at time 0. In order for the next overflow 
to occur in less than t, we have two cases: an overflow occurs from the 
(K — 2)nd trunk at time z, 0 < x < ¢ and the (K — 1)st trunk is 
busy, or the (K — 1)st trunk was free at time z, and so the next over- 
flow from the (K — 1)st trunk must come in less than ¢ — z units of 
time. 
Since the input to the entire trunk group is Poisson, 


Aift) = 1—e™ (8) 


and 
nN 
ae (9) 


a1(s) = 


Taking the Laplace-Stieltjes transform of (7), we obtain 


aK—1(S + uK-1) 
1 — ax-1(s) + ax-1(s + wx-1) co) 


ax(s) = 
Equation (10) can be used to obtain ax(ux) for computing Px 
via (6). Unfortunately, this computation is quite cumbersome if K 


is large. 
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To evaluate ax, we make use of the fact that 


d 
axk=- qe OK (8) - (11) 
After some algebra, 
1 
aK = —————~ Gx_1 (12) 
OK—1(uK-1) 
and, hence, 
QK : (13) 


~ Nera (i1)or2 (2) * + x1 (Mx—1) 


where a;(u;) can be obtained from (10). Equations (6) and (138) com- 
bine to give the obvious result, 


— Jl — ai(u1) k=1 
Ce ae Gee eG IOS. E" 


1.e., the probability that an offered attempt is carried on trunk K is the 
probability it is blocked on the first K — 1 trunks times the probability 
it is carried on trunk K, given that it is offered to trunk K. The prob- 
ability that a call is blocked on the first A — 1 trunk is simply the 
product of the individual call congestions, i.e., Bx = Bxr—-iax(ux), 
where Bx is the blocking probability on A trunks, with Bo = 1. 

In the special case of un; = uw, 7 = 1,2, ---, AK — 1 and wx = Gi, we 
obtain from (10) and (13) 


14S (SoM) ails + ay + = Dw 
j=l J 


ax(s) K K 
1 + a (4) i806 + a) (6 + (7 — Le) 
]= 
aK >= L/rv\B(K = 1, a). (15) 
Here B(n, a) is the Erlang B formula for n trunks offered load a, 
(a = X/p). 


In this case, a simple recursion exists for ax(s). From Ref. 6 we can 
obtain 


rN )— J 
azi(s) = STATON De 5 ay Hana(s) for j SK. (16) 
There is also a simple recursion for B(j, a): 
oy oe j 
BA(j,@) = 1+ ae (17) 


with 
B(O, a) = 1. (18) 
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Use of recursion (16) with s = @ together with (17) makes the cal- 
culation of the fraction ineffective Px = B(K — 1, a)[1 — ax(f) | 
straightforward. 

The blocking B (the probability of a call finding all N trunks oc- 
cupied) can be obtained by calculating the load overflowing the last 
trunk. Thus, 

B= ay (un) /dAan. (19) 


Even in the case where there is only one abnormal trunk in the 
group, this calculation of B can be quite tedious. This is especially 
true in the case where a large number of trunks follow the abnormal 
trunk in the ordering. Instead of (19), we shall approximately calcu- 
late B for the special case un; = uw, 7 ¥ K. 

We consider offering the overflow from trunk K to a hypothetical 
infinite trunk group with normal holding times. The mean m and 
variance v of the number of occupied trunks in the infinite group can 
be calculated from (this follows from Ch. 3 of Ref. 11, or p. 36 of 
Ref. 10) 


m = 1/paxs1 (20) 
i aa — 
Using (13) and (10), we obtain 
m = ax(fi)/dxp = aB(K — 1, a)ax(@) (22) 
p= m[1—m 4+ eee |. (23) 


Recursions (16) (used with s = yp, @, » + @) and (17) again make the 
calculation of m, v straightforward. 

We now apply the equivalent random method.? That is, we approxi- 
mate the overflow from trunk K by the overflow from an “equivalent”’ 
trunk group having normal trunk holding times, and which produces 
the same m, v on a hypothetical infinite trunk group. To determine 
blocking, we proceed along the same lines as in the equivalent random 
method, giving 

BRnB(N-K+No™) /s, (24) 
where NV, is the number of trunks in the equivalent group and \, is 
the attempt rate for the traffic offered to the equivalent group. (This 
approximation is exact where one trunk follows trunk K. In other 
cases, small errors can be expected.) 

Although the results in this section have assumed that the input 
stream is Poisson, it is a simple matter to extend them to handle 
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peaked* (overflow) traffic. In particular, one can use the equivalent 
random method to model the peaked traffic as the overflow from a 
single trunk group offered Poisson traffic, append this ‘‘primary”’ 
group to the trunk group of interest, and assume an ordered hunt 
selection for the extended trunk group. The hunting is done first over 
the primary group. Thus, if the Ath trunk were faulty in the trunk 
group of interest and there were N, trunks in the primary group 
modeling the peaked traffic, the faulty trunk will now be in the 
(N,. + K)th position.‘ The proportion of calls carried by the faulty 
trunk is determined by solving the extended trunk group problem and 
scaling the resulting proportion up by the ratio of the mean traffic 
intensity of the input to the extended group to the mean traffic in- 
tensity to the original trunk group. 

This completes our treatment of ordered selection. Numerical results 
are given in Section ITI. 


2.3 Analysis for two-sided selection 


In this section, two-sided selection is considered. We derive approxi- 
mations for the fraction ineffective P and for the blocking B. 

First, consider the extreme case with trunk K having zero holding 
time. The fraction ineffective seen by each traffic is P;) = B(K — 1, a), 
P, = B(N — K, az), where B(-,-) is the Erlang B formula. This 
case motivates the approximation to be described which, in its simplest 
form, ignores interaction between the separated subgroups of good 
trunks, and in more refined form accounts for some interaction.* This 
approximation, which is quite accurate, was used because a computa- 
tionally feasible solution to the state equations proved difficult. 

The notation to be used in this section is indicated in Fig. 1: m,, v; 
are the mean and variance of the traffic offered to trunk K, and 
Cj, Vi; are the mean and variance of the “crossover traffic.”” To use the 
convenient recursions developed in the preceding section, only the 
mean of the crossover traffic is used. This has a negligible effect on 
the computation for P, but can have a somewhat larger effect on the 
computation for B, because the peaked crossover traffics would ex- 
perience a higher blocking than the Poisson traffies. However, their 
mean is normally small compared to the Poisson traffic, which reduces 
the effect of their peakedness somewhat. In particular, for a1 = ae 


* The peakedness factor z(u) of a traffic stream is the equilibrium variance-to-mean 
ratio of busy servers when this traffic is offered to an infinite group of exponential 
ee with service rate u. The peakedness factor is larger than one for overflow 
traffic 

T Since nonintegral NV, can occur in the equivalent random method, interpolation 
may be necessary. 

The only real difference is that the refined form requires iteration. 
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TRUNK K _ 

(HANG—UP RATE jL) 
ny = K-1 TRUNKS ih - a ng = N-K TRUNKS 
(HANG-UP RATE }t) (HANG—UP RATE p) 


a a 
peels e e oe es ee i = @ L ———— — —— @ tC ~~ 
| 
(Co4,Y21) | | (m2 v2) 


ay = ay + C04 l ag’ = a2 + C19 
| | 


Fig. 1—Terminology for two-sided selection. 


(the condition for which the approximation is intended) we usually 
have C12 K de, C21 <«K a,;. When this is not the case, such as a large 
trunk group with the short-holding-time trunk in the first position, 
so that c12 may be of the same magnitude as ae, then cz has a low 
peakedness. 

To describe the computations, assume that the c;; are given and 
independent of the a;. Thus, consider a load a; = a; + c;; (treated as 
Poisson) offered to a group of n; trunks with hang-up rate yp, overflow- 
ing to a trunk with hang-up rate @. From the results in Section 2.1, 
the mean and variance of the overflow (in terms of hang-up rate 7) 
can be determined. First, we define \,; = a;,/u and solve the recursion 


[see (16) ] 


a+r, p— ] 
ayl = z+ M+ (J ~ De =e Cee | ay, (25) 
Ni LY 
given 
eee (26) 
i+ hy 
for 7 = 1,2, --:,n; +1. The parameter @; = @n,41 is the call con- 


gestion on a (fictitious) trunk with hang-up rate @. The mean and 
variance of the offered load to this trunk is 


= = aB(ni, a) (27) 


5. 
| 





0, = mi( : — mi). (28) 


1 — a; 


We must now get the mean and variance of the overflow traffic corre- 
sponding to a;, i.e., for the traffic offered to trunk K. Since c;; is 
treated as Poisson, clearly 


m= : a;B(ni, a). (29) 
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To get the variance, we use a result from Ref. 12 on the peakedness 
of split overflow, which gives 


m1 4S / 


mM; a, 





ca 1). (30) 
Unless a; = aj, this gives a value less than v;,/m;. The total mean and 
variance of the offered traffic to trunk K is then 


m= m+ me (31) 
v= Uy + do. (32) 


To compute the call congestion for trunk K, we assume (as in the 
equivalent random approach) that m, v are the mean and variance of 
a renewal input to trunk K. If a(-) represents the Laplace-Stieljes 
transform of the interarrival distribution for this renewal process, then 
the call congestion is given by @ = a(g@). But for the renewal model, 





v= m/( z-m) (33) 
1-—@ 
and, hence, 

ME ed _ 

= ear ore ers iz ’ (Z = v/m). (34) 


This results in a fraction ineffective for the total offered traffic to the 
group given by 


rm 1 
— at+a.m+2 (35) 


and an approximation for blocking, 


p — “Ll — Bim, ai)] + asf — Bir, a2) 1 (36) 
Qi + Qs 








B=1- 


Finally, we require that the crossover traffics assumed be consistent 
with the mean call congestion on trunk K, 


marez-1. 
m+2z 


This condition can be met by a few simple iteration steps, beginning 
with c;; = 0, and using (37) to update the c,;. 

When condition (37) is satisfied, eqs. (85) and (86) give the basic 
results for the two-sided selection procedure. It should be noted that, 
in addition to the treatment of each crossover traffic as Poisson and 
independent of the Poisson traffic offered to the same subgroup of good 
trunks, a second (minor) approximation is implicit in the computa- 
tions. This is the use of the average call congestion in (37) to update 


Ci = a;B (ni, a1) (37) 
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each crossover traffic. It is a straightforward matter to determine a 
separate call congestion for each traffic offered to trunk K, but the 
approximation is sufficiently accurate without this refinement. Nu- 
merical results using the approximations and comparisons with exact 
solutions (for small problems) are presented in Section III. 


2.4 Analysis for queuing and random selection 


To derive results for the case where the idle trunks form a queue, 
we define a two-dimensional state that represents the number of idle 
trunks in the queue together with the position of the abnormal trunk 
in the queue. We then derive equations for the equilibrium proba- 
bilities of each state. One can show that these probabilities exist and 
are unique.” The equilibrium probability that the abnormal trunk is 
busy, denoted by #, is then simply a sum for states in which the ab- 
normal trunk is busy. As indicated in Section 2.2, it is easy to show 
that the mean number of requests serviced by the abnormal trunk in 
a unit of time is given by 


jp (38) 
and, hence, the proportion of requests serviced by the abnormal trunk is 
P = jip/v = rp/a. (39) 


In the course of deriving our results, we show that it is equally 
likely that the abnormal trunk occupies any position in the queue of 
idle trunks. This, however, is equivalent to a random selection of idle 
trunks as far as the fraction ineffective and blocking are concerned. 
Hence, the results derived for P, B also apply to the random selection 
model. Strictly speaking, the two selection procedures are not equiva- 
lent, of course. For example, with queuing selection, an abnormal 
trunk would not serve two successive attempts if other trunks were 
idle, but this is a possibility with random selection. Under other as- 
sumptions, e.g., dependent retrials or non-Poisson input, these differ- 
ences become important. 

To proceed with the derivation, define 


L;; = the event that there are 7 idle trunks in the queue and that 
the abnormal trunk is in the zth position in the queue. 
Pi; = Prob {E;;}. 


We let z = 0 denote that the abnormal trunk is occupied. Thus, 


N-1 


p = D) Po. (40) 
j=0 


“We have a continuous-time Markov process, with a well-behaved embedded 
Markov chain (see Ref. 13). 
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The following equations can be readily derived relating the P;,’s: 
La + (N — 1)uJPoo = APou + AP ui, (41) 


[A Neg pb alPu = AP tN = ePoger tt AP i 
for 1sSj<N-—1, (42) 


[a + \]Po,n-1 = \Pin + uPo,n-2, (43) 


L(V _ De ae NIP; _ (N _ J a L)pP 35-3 = AP ist p41 
for O<it< 78 N, (44) 


LN ea AIP = PP ogee AP Aig 
for O<j7 SN, (45) 


and, trivially, 
P;; = 0 for i>y, or 7 >N; 
Py; = 0 for qa. 


These equations appear quite formidable because of the apparent lack 
of simple structure. A brute force algebraic approach seems unfruitful, 
as does a generating function approach. 

Instead, we make the conjecture that 


Py; = Po = +++ = P3; for all 7 2 1. (46) 


We should note that (46) is equivalent to assuming that the idle 
trunk is selected at random. This is true because P;; represents the 
probability of seeing the state (7, 7) at a random instant in equilibrium 
and Poisson arrivals see the same distribution. 

We justify our conjecture by using (46) in (41) to (45) together - 
with the fact that the probabilities must add to 1 and showing that 
the resulting equations have a simple solution. This solution can be 
substituted into the original equations to show that we indeed have 
the correct solution. However, since there exists a unique equilibrium 
solution to these equations and we have, in effect, solved them by 
adding additional constraints, the proof is complete without this step. 

Proceeding in the manner described, we first use (46) in (41) to (45) 
and obtain the following equations: 


La + (N — 1)p]Poo = APa + APu, (47) 


La et) as T= 1)p — d |Po; = APo,j41 + (N - Q)uP o,j-1 Si MP 541,341 
for 1<sj<N-—1, (48) 


[i + \]Po,n-1 = \Pww + wPo,n-2, (49) 


LN = ge AIP y= GN = 94 DeP jet pea FAP Hae 
for 0<j<N, (50) 
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and 
L(N = Tu = a lage = BP o,j-1 + NP 541,541 
for O<jSN. (651) 


Putting 7 = N in eq. (50), recalling that Pyii.wii1 = 0, we get 
oN 
Py-i,v-1 = BTN = aPyn. (52) 


Repeatedly reducing 7 by 1 and solving for P,;_1,;-1 in terms of Pyw, 
we get 
qs 


Py = (wait ¥™ ] =a 1, Nc (53) 


Using (53) in (51) results in 


A 
Finally, from (47) we get 
h @r-! 


One can check that expressions (53), (54), and (55) satisfy eqs. (48) 
and (49), which proves the conjecture. 

We compute Pyy from the fact that the sum of the probabilities 
must be 1. This yields 





N 3 a . ° 
Pew = y| et + 5(F fede) te ti] 66) 
ji ! ji 








N|N! ve N 
Thus, 
N-1 qs 
7 > p La = 7! (57) 
Os i = a er TN 
j=0 ss ae JM 
N pri JF 
‘ p> = N - B 
and 
N-1 a! 
B im at 
p=?P__, re (58) 
nes l4 4) 
j=0 q} N rN 


recalling that r = fi/n. 
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Equation (58) can be rewritten in terms of Erlang’s loss function as 


N-1 qi 
= =e 
as: ae 1)y 
7=0 J! r jai (7 = 
N-1 ai 
= J=0 i! 
i N=l qi NaN 1 Nolg 
— + —— -~—] — 
BatartG— ye & F 
Thus, 


a+ Nr —ar[1 — B(N —1,a)] 


An alternative form of (59) can be obtained via the standard recur- 
sion for Erlang B [see (17) ], giving 


_ rf[1 — B(N, a) ] 
eek ee ee heme CET OL) 
To get an expression for the blocking B, notice that 
N-1 =\ (7 N—1 _ 
R=Px _ a Pet (\/a) (aN "/(N_— 1)! 
pam a Sele de) 
Ja0 UN AV rN 
or 
_ NBN, a) 

= ark Baa (61) 


Although (61) and (62) also apply to the random model, it should be 
noted that the equivalence between random and queuing selection 
extends to the equilibrium distribution for redefined states (1, j), 
where 2 is the number of good trunks occupied and 7 is 1 if the abnormal 
trunk is occupied, 0 otherwise. In fact, it is straightforward to show 
from (53) to (55) that the redefined probabilities must satisfy 


Pig = TT Pog 
ait! t= 0,1,---,N —1, (62) 
Pu = a eel 


which solve the somewhat simpler equations for the random model 
(these equations are given in Ref. 4). 
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Il. NUMERICAL RESULTS 


This section presents selected numerical results obtained with the 
solution procedures of Section II. 


3.1 Ordered selection (Figs. 2 to 5) 


Figure 2 gives the fraction ineffective for r = 5 and the blocking* for 
r=5, N =20. Even for this relatively small value of r, the short-holding- 
time trunk has a significant impact, which decreases as the position 
in the order of selection increases. For large values of offered load, the 
dependence of fraction ineffective on K (position of the short-holding- 
time trunk) is decreased. In fact, it follows from Little’s Law’ that, 
for all K (and all selection procedures), P S r/a, with the bound 
approached as the time congestion on the short-holding-time trunk 
approaches 1.0; 1.e., for very large a. For a = 25, K = 1, we note that 
P = 0.17, r/a = 0.2. 

The results for fraction ineffective do not depend on the total 
number of trunks N 2 K. For blocking, N is important. For N = 20, 
r = 5, blocking is decreased from Erlang B blocking by about 3 in the 
(design) range for 20 to 30 percent overflow, with the decrease rela- 
tively insensitive to K. Blocking results are approximate, with errors 
resulting only from the application of the equivalent random method. 
In cases where exact solutions were compared to the approximate, 
the agreement was good. The decreased overflow resulting from a 
short-holding-time trunk can contribute substantial errors in estima- 
tion procedures for which overflow enters. Usage measurements are 
also affected by a short-holding-time trunk, with mean carried usage 
given by aP/r + a(1 — P — B). 

Figure 3, for r = 15, is similar to Fig. 2, but with larger impacts 
and more spread between the curves. The ratio r = 15 is typical for a 
short-holding-time trunk and shows that P > 0.4 can easily occur, 
even for relatively large loads. The limiting ratior = © (zero-holding- 
time trunk) would result in P*® = B(K — 1,a), and B®? = 0, which 
are substantially different from the r = 15 results. For example, 
P*? = 1forK = 1. 


3.2 Two-sided selection (Figs. 4 and 5) 


Figure 4 gives the behavior for N = 5, r = 15, including a com- 
parison with the exact solution obtained by solving the state equations 
for the system. The approximation for P displays essentially no error 
for low blocking. This is to be expected for conditions under which the 


* Recall that blocking refers to the fraction of attempts that find all trunks busy in 
the group, whether or not it is a final or high-usage group. 


1142 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1975 


ERLANG B BLOCKING 





m—am—— AVERAGE OVER ALL K 





0 5 10 15 20 25 30 
OFFERED LOAD !IN ERLANGS 


Fig. 2—Behavior of ordered selection for r = 5, N = 20. 


offered load to trunk K comes from only one side (e.g., K = 1, low 
offered load), since the approximation for P becomes exact. For other 
low blocking conditions (such as K = 3, low offered load), no dis- 
cernible error indicates that modeling the traffics offered to trunk K 
as a renewal stream introduces essentially no error in the computed 
call congestion for trunk K. As the crossover traffies increase, and, 
hence, there is interaction between the separated subgroups of good 
trunks, the approximation shows some error. The error is well-behaved, 
and increases to only about 2 to 3 percent. For larger groups, the error 
is generally less and is not shown on the figures. 
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ERLANG B BLOCKING 


aero AVERAGE OVER ALL K 





0 5 10 15 20 25 30 
OFFERED LOAD IN ERLANGS 


Fig. 3—Behavior of ordered selection for r = 15, N = 20. 


Errors in the blocking are larger. Since peakedness is ignored in 
determinining the crossover traffics and the carried load on the good 
trunks, this is to be expected. However, the absolute error does not 
increase significantly with offered load. As the change from Erlang B 
blocking is the important factor in applications, this error behavior is 
acceptable. As for ordered selection, the blocking is relatively insensi- 
tive to K. 

The interesting behavior for K = 1 is apparently due to the size 
of the group. For K = 1, N 2 2, qualitatively, one would expect that 
the fraction ineffective seen by load a; monotonically decreases from 
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ERLANG B BLOCKING 


APPROXIMATE SOLUTION 
me «EXACT «SOLUTION 
moemm= AVERAGE OVER ALL K 





0 2 4 6 8 10 12 


OFFERED LOAD IN ERLANGS 


Fig. 4—Behavior of two-sided selection for N = 5, r = 15. 


1.0, while for ae it initially increases from 0, as the offered load in- 
creases. The exact solution for N = 5 confirmed this behavior. How- 
ever, the fraction ineffective seen by load a2 increases fast enough to 
create an increase in P, before its ultimate decrease. As a result, 
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~ 0.5 in the relatively wide range 0 to 10 erlangs. I’or larger values 
- of N, P was found to be monotonically decreasing for K = 1. On the 
other hand, for N = 2, P can easily be shown to initially increase. 
Since a; = de, for N = 2, two-sided and random selection are equiva- 
lent, and (59) applies. Thus, the conditions that produce the qualita- 
tive behavior of Fig. 4 are difficult to predict. 

Figure 5 illustrates the behavior for r = 15, N = 20. Compared to 
ordered selection, the impact is less, and the dependence on K less 
pronounced, especially for blocking. This is attributable to the diversity 
introduced into the selection procedure by two-sided selection. How- 
ever, a substantial overall impact is still evident. For N = 20, even 
for the most favorable case K = 10, for a design overflow of 20 percent, 
P 0.2, and overflow is reduced to 5 to 6 percent. 

The limiting case for r = © gives 


p» — BK — 1, 4/2) + BIN — K, a/2) 
= 


For N = 20, K = 1, this gives a value of 0.5 over the range 0 to 25 
erlangs, compared to the value of P = 0.29 for a = 25, r = 15. For 
N = 20, K = 10, P* = 0.385 at a = 25, or about 50 percent higher 
than the value for r = 15. 


3.3 Random/queuing selection (Figs. 6 and 7) 


In each case in Figs. 6 and 7, P begins at 1/N, then increases over 
the range of offered loads shown. The 1/WN initial behavior is intuitively 
obvious for random selection. Queuing selection, on the other hand, 
guarantees that, when a trunk becomes idle, trunks already idle must 
serve calls before it can be picked for service. For very low offered 
loads, a short-holding-time trunk essentially serves every Nth call 
to give the 1/N initial behavior. 

Analysis of eq. (59) indicates that P can have at most one extremum. 
For r > 1, this occurs at the unique root of the equation 


0 1 
5g (ell — BWW — J, a) J} = —- 


Thus, random/queuing selection never displays the complex behavior 
observed in Fig. 4 for two-sided selection. Because the extremum occurs 
for relatively large values of a, a short-holding-time trunk has a 
larger impact in a high usage group than in a final group of the same 
size. For example, for ten trunks, 4.5 erlangs is a typical offered load 
for a final, and 9.5 erlangs for a high usage group. From Fig. 6, P = 0.17 
at a = 4.5 erlangs, whereas P = 0.28 at a = 9.5 erlangs, 1e., the 
relative impact is larger. The time congestion aP/r for the short-holding- 
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Fig. 5—Behavior of two-sided selection for V = 20, r = 15. 


time trunk is 0.052 at a = 4.5 erlangs, and 0.175 at a = 9.5 erlangs, 
i.e., the absolute impact of the short-holding-time trunk is over three 
times as much for the high-usage application. In terms of the expected 
time congestion with a short-holding-time trunk equally likely in any 
position, this general behavior also holds for two-sided and ordered 
selection. This follows from the increase in average fraction ineffective 
as the offered load increases. 
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0 5 10 15 20 25 30 
OFFERED LOAD IN ERLANGS 


Fig. 6—Behavior of random selection for r = 15, N = 5, 10, 20. 
Figure 7 displays the behavior of random/queuing selection as r 


varies. In all cases, P is close to its maximum over a fairly wide range. 
The limiting case r = © gives [see (59) ] 


1 
N — afl — B(N — 1, a)] 


For N = 20, a = 25 erlangs, P* = 0.36, compared to the r = 20 


pe = 
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Fig. 7—Behavior of random selection for N = 20, r = 5, 10, 15, 20. 


value from Fig. 7 of P = 0.23. Since P is no more difficult to compute 
than P*, the upper bound P* is of limited use in this case. 


3.4 Comparison of selection procedures 


- To compare selection procedures, the measure taken is the expected 
value of P, denoted by P, given that a short-holding-time trunk 
is equally likely in any position. Computationally, it was found that 


Random 

- |Or | <p Two-sided < Pp Ordered 
queuing | ~— selection = selection } ’ 
selection 


with equality occurring only at a = 0. Figure 8 for N = 20, r = 15 dis- 
plays typical behavior. The differences are substantial. It is likely that 
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Fig. 8—Comparison of selection procedures for short-holding-time trunk equally 
likely in any position (NV = 20, r = 15). 


any improvement over the random/queuing behavior would require 
a selection procedure based on more information. For example, 
choosing the idle trunk whose last holding time was longest would 
further reduce the impact of a short-holding-time trunk. 

The expected blocking does not show much difference from one selec- 
tion procedure to another, but all disciplines show a substantial reduc- 
tion from the Erlang B results. 


3.5 Validation remarks 


The basic traffic assumptions of the model concern holding times 
and the arrival process. For normal trunk holding times (which result 
from a mix of conversations, busys, don’t answers), the exponential 
assumption is reasonable. For short-holding-time trunks, this assump- 
tion may be less valid. For example, trunks resulting in immediate 
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reorder returned to the customer would not likely display the expo- 
nential behavior. However, since the Erlang B formula holds for 
arbitrary holding-time distributions, it is reasonable to expect that the 
results should not be too sensitive to the form of the distribution. 

The Poisson assumption for the arrival process can be invalidated 
due to time variations, to peakedness (for groups receiving overflow 
traffic), or to retrial behavior. For ordered selection with K = 1, 
peakedness clearly reduces the impact of a short-holding-time trunk. 
This behavior may be reversed for higher values of K since peakedness 
increases the mean attempt rate to trunk K. For example, forr = ~, 
K 2 2, P is increased. However, several peaked traffic computations 
for r = 15 give values of the fraction ineffective that are very close 
to the Poisson value. 

Even if the Poisson assumption is valid with all trunks good, it can 
be violated in the presence of a short-holding-time trunk. The short 
time between a retrial and the failure that initiated it gives the retrial 
a high probability of meeting the same conditions as the initial failure. 
For two-sided, ordered, and random selection, this could increase the 
impact of the short-holding trunk over what would be expected from 
a Poisson assumption. 

The model is also susceptible to variations from the idealized selec- 
tion procedures. This holds particularly for applications to older 
systems where grading and multipling arrangements can lead to many 
different selection procedures. It also holds for applications to 5XB 
systems, where any irregularity in the distribution of trunks on the 
frames can distort the selection procedure from the idealized random 
selection model. In fact, individual circuit usage results would seem to 
indicate that significant discrepancies from (cyclic) random selection 
may be quite common in 5XB. Thus, for any specific application, 
suitability of the model would have to be determined. 

Figure 9 shows data obtained by special measurements from a Cross- 
bar Tandem group containing a short-holding-time trunk. The peg 
count and usage measurements during the study hour indicate that 
trunk 2 had a short holding time relative to other trunks in the 
group, and a failure condition was subsequently verified. Thus, all 
35 attempts on the trunk are assumed to have failed. The large im- 
balance in the offered attempts from either side of the group could be 
statistical, or it could be due to irregularities in incoming trunk 
assignment to frames (which determines the order of selection). It is 
more likely due to retrials. Thus, the reasonable assumption that 
a, = a» can be violated in some situations. | 

Since the measured hour was not the busy hour, the left side of the 
group can be treated like ordered selection. The estimate for a mean 
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Fig. 9—Example of a short-holding-time trunk. 


attempt rate for a Poisson offered traffic is } = 49 attempts/hour. For 
the offered load a in erlangs, a normal holding time is associated with 
each attempt, to give d = 3.3 erlangs. The ratio of holding times is 
r = 13. For these values, the ordered selection model with K = 2 
predicts P = 0.62. Thus, the realized value of ineffectives (35) is close 
to the average value (30.5) predicted from the same data. The dis- 
crepancy could be statistical, or to modeling the stream as Poisson, 
which smoothes out the retrial stream. 

The preceding very limited comparison is not a validation. Because 
of the various factors noted, a comprehensive validation of the models 
is difficult. Since the models are not intended for design purposes, but 
only to estimate impact of short-holding-time trunks, such validation 
has not been attempted. 


IV. CONCLUSIONS 

This paper has considered analytical models for groups containing 
short-holding-time trunks. The models confirm that these trunks can 
have a substantial impact on customer service. Although trafic and 
system characteristics can differ from those of the model, it is felt that 
the models are adequate for estimation purposes. 

The numerical results indicate that the type of system and type of 
group (high usage or final) lead to significant differences in performance 
in the presence of a short-holding-time trunk. These models and results 
are directly useful in devising optimum strategies for deploying main- 
tenance resources to minimize the service impact of short-holding-time 
trunks. 
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Linear predictive coding is an efficient method for transmitting the 
amplitudes of moving-area picture elements (pels) in a conditional re- 
plenishment coder for video-telephone signals. It has been conjectured that 
uf the linear predictor can dynamically adapt to the speed and direction of 
motion in the scene, then greatly improved performance should result. To 
test this conjecture and to get a first-order estimate of the possible saving, 
computer simulations were carried out using pairs of video-telephone 
Jrames stored on digital discs. Using this data, picture quality could not 
be studied. However, differential signal entropies could be estimated, and 
this was done for several nonadaptive and adaptive linear predictors. 
Entropies (in bits per moving-area pel) for adaptive linear predictors 
were significantly lower than for nonadaptive predictors, indicating that 
substantial bit-rate savings should be possible. However, simpler imple- 
mentations will have to be devised before adaptive prediction becomes 
practicable. 


l. INTRODUCTION 


In coding television pictures for transmission over a digital channel, 
it is well known that the required bit rate can be significantly reduced 
by removing various redundancies that exist in the signal, and in 
recent years methods for removing frame-to-frame redundancy have 
been investigated.! In a conditional replenishment? system, only the 
picture elements (pels) that have changed significantly since the 
previous frame are transmitted. Their amplitudes as well as their 
locations must be sent; however, most of the transmission capacity 
is used in sending the amplitudes. During periods of rapid motion, only 
every other moving-area pel need be transmitted, i.e., the moving area 
of the picture can be subsampled?“ at half-rate with the unsampled 
pels being replaced by the average of their neighbors. 


1155 


Linear predictive coding is an efficient method of transmitting these 
amplitudes. Channel rates of 1 bit per pel and below have been ob- 
tained.*~7 With this technique, a prediction is formed of each pel to 
be sent by computing a linear combination of previously transmitted 
pels. The difference between the actual value and prediction is then 
quantized and coded for transmission. Since the differential signal is 
small usually and large only occasionally, variable word-length coding 
can be used to good advantage in reducing the overall bit rate. 

The entropy of the quantized moving-area differential signal pro- 
vides an estimate of the average number of bits required to transmit 
a pel. Thus, it is a good yardstick for comparing the performance of 
various frame-to-frame predictive coders. The entropy will depend on 
the amount of detail (frequency and amplitude of brightness varia- 
tions) in the moving area of a frame as well as on the speed of move- 
ment in the scene. The overall bit rate, however, is strongly dependent 
on the number of pels in the moving area which, in turn, is deter- 
mined by the type of picture to be transmitted. See Ref. 8 for statistics 
on the number of moving-area pels per frame in typical video-telephone 
signals. | 

In Refs. 9 and 10, and by simple extension of the techniques of Ref. 
11, it is suggested that if the predictor can dynamically adapt to the 
speed and direction of motion in the scene, then greatly improved 
performance should result. For example, if an object is moving left 
to right at a speed of about 1 pel per frame period (PEF) then for each 
moving-area pel of the present frame a very good prediction should 
be obtainable by going back to the previous frame and looking 1 pel 
to the left. Other types of adaptive linear prediction are described in 
Refs. 12 to 14. They suggest that the weighting coefficients in the 
linear predictor be varied adaptively to make the differential signal 
smaller. 


ll. COMPUTER SIMULATION 


_ To get some comparison between nonadaptive and adaptive, frame- 
to-frame, linear predictors, computer simulations were carried out 
using about three dozen video-telephone picture sequences stored as 
8-bit pcom* on digital disc (two successive frames per sequence). With 
only two frames available per sequence, picture quality could not be 
studied. However, moving-area differential signal entropies could be 
estimated, and this was done for several nonadaptive and adaptive 
predictors. | 


* Characteristics: 30-Hz frame-rate, 271 lines, 2:1 interlace, 3 dB down at 1 MHz, 
2-MHz sampling-rate, 8 bits/sample, 210 visible samples/line. — 
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Frame-to-frame noise in the pictures was small—in most cases, less 
than 1.5 percent of black-to-white signal amplitude. Thus, detecting 
the moving-area pels was not difficult. This was done as follows: 


(2) Frame-to-frame differences larger in magnitude than 4 out of 
a possible 255 were detected. 

(72) If a significant change had two insignificant changes directly 
to the left and two insignificant changes directly to the right, 
or uf it had two insignificant changes directly above and two 
insignificant changes directly below, it was deemed to be 
insignificant, i.e., caused by noise rather than movement. 

(212) Finally, horizontal gaps of six pels or less between significant 
changes were deemed to be also in the moving area. 


This procedure defines the moving-area very well. See Ref. 4, Figs. 
1, 4, and 7 for pictorial examples. 

Figure 1 shows some of the pictures used. Figures la to 1c are scenes 
containing a Mannequin’s head, which could be moved horizontally at 
various speeds. The smaller the head size, the more detail there is in 
the moving area. Thus, with these scenes results could be obtained 
for various speeds and for various amounts of moving-area detail. 

About half of the picture sequences were of live subjects engaged in 
typical video-telephone conversations, such as shown in Figs. 1d and le. 
These scenes were important in comparing different linear predictors 
because they were more representative of what would normally be 
encountered in practice. The speed was not constant over the whole 
moving area as it was with the mannequin head, and there were more 
variations in lighting and picture detail. For these scenes the speed of 
movement could only be estimated to the nearest PEF (pels per frame 
period) by observing the frame-to-frame displacement of the edge of 
the moving area. 

In video-telephone scenes, speeds range from slow (0.5 PEF) to very 
fast (4 PEF). Very rapid movement is rare, however, and in such 
instances the viewer is less critical of picture quality since he is already 
accustomed to seeing blurred moving-areas in cinema and television 
pictures. 


Ill, NONADAPTIVE LINEAR PREDICTIVE CODING 


Figure 2 shows two successive frames with interlacing fields (two 
interlaced fields per frame). Suppose Z is a moving-area pel we wish to 
transmit. Pels A, B, C, G, and A are in the field presently being 
scanned; pels D, LE, F, R, S, and T are in the previous field; and the 
remaining pels are one frame period back from the present field. Pel 
M is the previous frame value of Z. The general linear prediction of 
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Fig. 1—Typical pictures used in the simulations. (a) Small head. (b) Medium head. (c) Large head. (d) and (e) Live subjects. 
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Fig. 2—Two successive television frames with interlacing assumed (two interlaced 
fields per frame). Pels Z and M are exactly one frame period apart. 


Z based on the previously transmitted pels, which are nearby both 
spatially and temporally, is given by 


Pz=aA +a2.B +030 + --- + anV, (1) 


where a1, @2, «++, @22 are the weighting coefficients. 

Since Z is 8-bit pcm and Pz is clipped to 8-bit pcm, the differential 
signal Z-Pz can assume any of 511 levels. The entropy of the 511-level 
signal is a measure of the relative performance of different predictors. 
However, in practice, a more coarsely quantized signal (consistent 
with acceptable picture quality) would probably be transmitted to 
reduce the overall bit rate. 

Picture quality could not be observed in the simulations. However, 
to get a rough estimate of bit rate for linear predictors with coarser 
quantization, a compromise 35-level* quantization scale was chosen 
that is slightly coarser than the quantizer of Ref. 4 for frame differ- 
ences and somewhat finer than the one used in Ref. 7 during periods 
of movement for element differences. Using this quantization scale, 
slope overload rarely occurs, and the predominant picture degradation 
is granular noise in the moving areas. This has been verified in recent 
preliminary simulations using thirty consecutive frames stored on 
digital disc. Using the nonadaptive and adaptive linear predictors 
mentioned in this paper, there is little difference in picture quality 
in the limited picture sample studied. However, much work remains to 
be done in this area. 


*On a scale of 0 to 255, the levels are: 0, +5, +14, +22, +30, +40, +50, +60, 
+70, +82, £94, +106, £118, +130, +142, 4154, +166, and +178. 
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The effects of the predictive coder feedback loop are ignored in 
the entropy measurements. Feedback can affect results to a significant 
degree if quantization is extremely coarse. Even so, the entropy mea- 
surements reported here are in close agreement with those of Refs. 
4 and 7. 

For various nonadaptive linear predictors, Figs. 3 to 8 show the 
entropy (bits per moving-area pel) of the differential signal as a func- 
tion of the speed of movement (pels per frame period or PEFs). Results 
are summarized in Table I. For these cases, more moving-area detail 
(smaller head) resulted in somewhat higher entropies, as would be 
expected, but moving-head results were still fairly close to each other. 
The live-subject entropies were close to the moving-head entropies for 
the most part, even though there was considerable variation in moving- 
area picture detail. 
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Fig. 3—Entropy of the frame difference signal in the moving area versus speed. 
Pz = M. Starting with 8-bit pcm as was done in these simulations the differential 
signal could assume any of 511 levels. Results are also shown for coarser quantization 
to 35 levels which still gives very good picture quality. Solid curves are for the man- 
nequin head at various distances from the camera. Unconnected points are for live 
subjects, such as in Figs. ld and le. 
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Table |— Entropies of some nonadaptive linear predictors 


Entropies in Bits Per 


Transmitted Signal Pz Movin -Area Pel 
4—Pz (See Fig. 2) (35-Level Quantization) 

Frame difference M = 2.1-3.9 
Element difference H = 2.0-3.7 
Element difference of frame 

difference M+H-L ~1.8-3.1 
Line difference of frame 

difference M+B-—J = 1.5-3.5 
Field difference (BE + 8)/2 ~1.8-3.2 
Element difference of field H+ (E+8)/2 

difference —(D + R)/2 ~1.5-2.5 


The frame-difference* entropy (Fig. 3) increases with speed as ex- 
pected. However, the element-difference entropy (Fig. 4) decreases as 
the speed increases, because of blurring introduced by the camera.’ 
It drops below the frame-difference entropy at a speed of about 1 
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Fig. 4—Entropy of the element difference signal in the moving area versus speed. 
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PEF.’ Subsampling at half-rate, however, causes it to rise above the 
levels shown in Fig. 4. 

The element difference of frame-difference entropy" and line differ- 
ence of frame-difference entropy’® (Figs. 5 and 6) are very close to 
each other even though with interlace (see Fig. 2) the previous line is 
further away from pel Z than is the previous element. D. J. Connor 
has pointed out that this occurs because movement in the scenes is 
mostly in the horizontal direction. However, the line-difference of 
frame-difference signal has the advantage of being unaffected by 
subsampling along the line. 

The field-difference entropy!’ (Fig. 7) is lower than the frame-differ- 
ence entropy, except at very slow speeds, because of the spatial and 
temporal closeness of the previous field pels. It compares well even 
with the double derivative signal of Figs. 5 and 6. The element differ- 
ence of field-difference entropy (Fig. 8) is smaller than any of the 
others and, other factors ignored, would be the logical choice in a non- 
adaptive, frame-to-frame, linear predictive coder. However, sub- 
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Fig. 5—Entropy of the element difference of frame-difference signal in the moving 
area versus speed. Pz = M+ H—L 
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Fig. 6—Entropy of the line difference of frame-difference signal in the moving 
area versus speed. Pz = M + B—-—J 


sampling along the line leads to a considerable increase in entropy, 
putting it above the line difference of frame-difference entropy. 
Furthermore, conditional vertical subsampling!*:* makes impracticable 
the use of any predictive coder that uses previous field pels. 


IV. ADAPTIVE FRAME-DIFFERENTIAL CODING WITH MOVEMENT 
COMPENSATION 
In Refs. 9 and 10, and by simple extension of the techniques of 
Ref. 11, frame-differential coding is adapted to the speed and direction 
of movement in the scene. Thus, if in Fig. 2 the speed of the moving 
object is about 1 PEF left to right, then pel L is a much better predic- 
tion of Z than J is. Similarly, if movement is 1 PEF right to left, then 
pel N is a better prediction. Such a coder must first estimate the veloc- 
ity (speed and direction) of the moving object and then transmit this 
estimate to the receiver before sending the quantized moving-area 
differential signal. 
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Fig. ara of the field-difference signal in the moving area versus speed. 
Pz = (E+ 8)/2 


Some computer simulations were carried out to test such schemes. 
First, each field was divided into 64 smaller blocks of 27 pels X 16 lines 
each, in an attempt to accommodate velocity variations within the 
field. Within a block the velocity was assumed constant. Then, for 
pel Z in the moving-area of a block, the 17 differences (see Fig. 2) 
Z-D, Z-E, ---, Z-Q were computed between Z and the six previous 
field pels and the 11 previous frame pels located relative to Z, as shown 
in Fig. 2. The magnitudes of these 17 differences were each summed 
over the moving area of the block. Following this, the 17 accumulated 
sums were examined to determine the smallest one. Then, for each 
moving-area pel Z, the pel in the relative position that yielded the 
smallest accumulated sum was used as a prediction within that block, 
and statistics of Z-Pz were measured. | 

This technique always gave a prediction which corresponded to the 
correct direction of motion, and was fairly accurate with regard to speed 
for speeds $2 per. However, for a given block, the accumulated sums 
were all quite close to each other, reflecting the high pel-to-pel correla- 
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Fig. 8—Entropy of the element difference of field-difference signal in the moving 
area versus speed. Pz = (HF + S)/2 +H — (D+ RB)/2. 


tion present in most television pictures. In low-detail moving areas, 
of course, tracking was less accurate; but in these regions choice of the 
pel to be used as a prediction was also less important since they all had 
approximately the same value anyway. 

Entropies of the differential signal Z-Pz are shown in Fig. 9, again 
with no additional quantization and with 35-level quantization. Re- 
sults are comparable to those for the element difference of field differ- 
ence, but they are much less regular. In addition, there is a sharp dip 
around 1 preF where, for the moving head at least, pel L (see Fig. 2) 
should be a near perfect predictor of pel Z. There is another less sharp 
dip near 2 per. These results are encouraging, but they indicate that 
movement at a nonintegral number of PEF should be handled in some 
other manner. 


V. MINIMUM MEAN-SQUARE-ERROR LINEAR PREDICTION 


In the general case of adaptive linear predictive coding, the weight- 
ing coefficients of eq. (1) are changed periodically to. obtain a small 
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Fig. 9—Entropy of the differential signal in the moving area when movement 
compensation techniques are carried out. In this case, Pz equals the pel in the previous 
field or frame that gives the smallest |Z — Pz| averaged over the moving area. A 
sharp dip in the entropy occurs near | PEF. 


entropy for the differential signal Z-Pz. If a block of pels is scanned 
before coding to derive an appropriate set of a’s, then these would 
have to be sent to the receiver, followed by the moving-area differential 
signals Z-P z. 

In general, it is not known how to find the a’s that minimize the 
entropy of Z-Pz. It is possible, however, to determine the a’s that 
minimize the mean squared value of Z-Pz. As will be seen later, the 
resulting entropy turns out to be smaller than with any of the pre- 
viously discussed methods. In fact, using search techniques, it is 
not possible to find a’s yielding entropies very much smaller than those 
obtained using minimum, mean-square-error (MSE), linear prediction. 

Let SW be the set of moving-area pels in the block of pels presently 
being scanned, and let Z; be the 7th member of OW. Let Ya = Az, 
Yio = B; Yi3,= Ci, -++, Yiog = V; be the pels located relative to Z,, 
as shown in Fig. 2. Then, to minimize the mean squared prediction 
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error over Si 1t 1s sufficient to minimize 
22 


Q= 2 (4: — Pz) = U (Zi - 2, a5¥ 3)’. (2) 
Set the partial derivatives with respect to a1, a2, --+, @22 all equal to 
zero, l.e., fork = 1, 2, ---, 22 
dQ 22 
Se Dy Ase = 2 aj Vii) Vs = 0. (3) 
OO}, i j=l 


In matrix notation, these simultaneous equations can be written 
Qa = B, (4) 
where the 7kth element of the square matrix @ is 


DL VaiY ua, (5) 


the kth element of the column matrix @ is 


se Lil ik, (6) 


and the kth element of the column matrix @ is ax. 

Matrix @ is symmetric, and if it has an inverse, eq. (4) has a unique 
solution. Otherwise, many solutions exist. Furthermore, any solution 
to (4) will minimize the Q of (2) since Q is a convex downward function 
of a. 

This procedure should take advantage of many types of redundancy 
in the picture. By tracking movement in the scene, frame-to-frame 
redundancy is removed. By using pels in the present and previous field, 
intraframe redundancy is also removed. 


VI. ADAPTIVE CODING USING ONE SET OF a’s PER FIELD 


In the simulations to be described next, a set of a’s was chosen to 
minimize the squared differential signal averaged over the moving area 
of the entire field. That is, in the preceding equations, summations 
with respect to 7 were over the entire moving area of each field. In this 
case the a’s can be sent using negligible channel capacity. 

Entropy results are shown in Fig. 10. Values are significantly less 
than all of the previously described results. The live subject results 
are not as good as those of the moving head, however, mainly because 
of the velocity variation within the moving area of a field. At speeds 
above about 2 rrr, results are surprisingly good given the pel con- 
figuration of Fig. 2, which would, at first, be expected to encompass 
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Fig. 10—-Entropy of the differential signal in the moving area when minimum MSE 
linear prediction is carried out. In each field Pz is the linear combination of the 22 
pels shown in Fig. 2, which minimizes (2 — Pz)? averaged over the moving area. 
Entropies are significantly smaller than any of the previous results; however, live 
subject results are, in general, above the moving area results. : 


only speeds less than 2 prr. This may happen because at the higher 
speeds, significant blurring is introduced by the camera making 
interpolative predictions more efficient. 

Figure 11 shows a representative set of derived weighting coefficients 
(multiplied by 100) arranged in the configuration of Fig. 2. The speed 
of the moving head is 0.63 prEr, left to right. Thus, the point on the 
moving object which is now at pel Z is in the previous frame 0.63 
pel to the left of M. The linear predictor places very little weight on 
pels K, N, Q, and V, since motion is from left to right. It interpolates 
between pels L and M as it should for this speed, but numerous other 
differentiations are also present. For example, a form of element- 
difference of field-difference prediction is attempted in the present 
frame. Some weight is also placed on previous line pels. 

A question which arises at this point is: how much of the removed 
redundancy is frame-to-frame, and how much of it is intraframe? 
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Thus, some simulations were carried out where the predictor was 
only allowed to use pels from the previous frame in its prediction. This 
is done by deleting in eq. (4) the rows and columns of matrix @ and 
the elements of column vectors a and @ that correspond to the unused 
pels. Entropy results are shown in Fig. 12. 

The results are practically the same as in Fig. 10 up to a speed 
of about 1.38 PEF which is nearly the range of speeds for which one 
should expect good performance, given the configuration of Fig. 2. 
Thus, most of the redundancy removed is frame-to-frame redundancy. 
Intraframe redundancy becomes important only when the speed is 
outside the tracking range of the algorithm. 

Some simulations were also carried out to test the effect of sub- 
sampling at half rate. In this case the predictor is only allowed to use 
pels B, D, F, G, J, M, P, R, T, U, and V in the prediction. Entropy 
results are shown in Fig. 13 for the speed range of 1 to 4 PEF, which 
is Where subsampling would normally be carried out. 

Subsampling increased the entropy as one would expect, but the 
amount of the increase depended on how much detail there was in the 
moving area of the scene. With 35-level quantization, subsampling 
increased the entropies by factors ranging from 1.5 to 2.3 for the large 
head, 1.7 to 2.9 for the medium head, and 3.2 to 4.6 for the small head. 
As the size of the head decreases, the amount of detail increases and, 
thus, so does the detrimental effect of subsampling. Of course, if the 
entropy is more than doubled by subsampling, then subsampling at 
half rate does not pay. 
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Fig. 11—A representative set of 22 minimum MsE weighting coefficients multiplied 
by 100 (see Fig. 2). The mannequin head was moving at 0.63 per. Thus, a moving 
object point which is at Z in the present frame was 0.63 pels to the left of 17 in the 
previous frame. 
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Fig. 12—Entropy of the differential signal in the moving area with minimum MSE 
linear prediction; Pz is a linear combination only of the 11 pels in the previous frame. 
Results are about the same as those in Fig. 10 up to a speed of 1.3 per. Thus, in this 
range, the coder is removing mostly frame-to-frame redundancy. 


Vil. SUBDIVISION OF THE FIELD INTO SMALLER BLOCKS 


In Fig. 10, where one set of weighting coefficients is used for the 
entire moving area of the field, the entropies corresponding to the live 
subjects are, in general, larger than those of the moving head because 
the head moves with pure translation, whereas different parts of live 
subjects move with different velocities. Velocity variations within a 
field can be accommodated by first dividing the field into smaller 
blocks, and then using a different set of weighting coefficients within 
each block. The @’s which give the minimum mean square prediction 
error over the moving area of a given block can be computed from eq. 
(4) exactly as before. 

Some simulations were carried out to determine the effect of using 
smaller blocks. A slightly different set of pictures was used for these 
computations than was used for the previous results. The mannequin 
head was slightly larger than in Fig. 1b, but the live subjects were 
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Fig. 13—Entropy of the differential signal in the moving area with minimum MSE 
linear prediction with subsampling, 1.e., Pz is a linear combination only of pels B, 
D, F, G, J, M, P, Rk, T, U, and V. 


about the same as in Figs. 1d and le. It was found in the simulations 
that the determinant of matrix @ in eq. (4) was frequently zero, indi- 
cating nonunique solutions, i.e., one or more of the a’s could be chosen 
arbitrarily. In this case, the arbitrary a’s were set equal to zero and, 
whenever possible, they were chosen to correspond to pels in the 
present frame. . 

Figure 14 shows entropies for 511-level differential signals. The 
uppermost curve and its associated live-subject points are obtained 
with no subdivision, as in Fig. 10. The single points are all above the 
moving-head curve, as before. 

The middle curve and points are the entropies which result from 
subdividing the field into 120 blocks (10 horizontal X 12 vertical) and 
using a different set of weighting coefficients for each block. Each block 
is 21 pels X 10 lines. Not only are the single points closer to the moving- 
head curve, but all of the points are considerably below those obtained 
using no subdivision. Using smaller blocks, therefore, not only ac- 
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Fig. 14—Entropy of the differential signal in the moving area with minimum MSE 
linear prediction and prior subdivision of each field into smaller blocks. A different 
set of weighting coefficients is used in each block. As the block size is reduced, the 
live-subject results come closer to the moving-head results because variations in 
velocity within a field are accommodated. In addition, all entropy figures are reduced 
because more intraframe redundancy is removed. 


commodates variations in velocity within the field, but it also removes 
some in-frame redundancy that could not be removed previously. 
There is a penalty to pay, however. The a’s of each block in which 
there is movement must be transmitted to the receiver so that it can 
correctly decode the received differential signal. If the e’s are sent via 
8-bit pcmM, and movement occurs in 50 percent of the blocks, then 
22 X 8 X 60 = 10,560 bits (or about 0.84 bit per moving-area pel) 
must be transmitted just to send the a’s. Clearly, some alternate 
method of transmitting the a’s should be used. For example, they 
might be estimated from the minimum MSE a’s of some nearby pre- 
viously transmitted block, or the vector a might be chosen from some 
predefined finite set of vectors known both to the coder and decoder. 
With even smaller blocks the entropies are further reduced. The 
bottom curve and points of Fig. 14 result from subdividing the field 
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into 552 blocks (23 horizontal X 24 vertical). Each block is 9 pels X 5 
lines. Coarse quantization of the differential signal in this case gives 
zero nearly always. Thus, nearly all of the information about the pels 
is carried in the weighting coefficients which, with 50 percent of the 
area moving, require 48,576 bits for transmission using 8-bit pcm. So 
if the blocks are made too small, the algorithm is inefficient because 
of the bits required to send the a’s. 


Vill. CONCLUSION 


In a conditional replenishment system for transmitting video-tele- 
phone pictures, the moving-area picture elements can be coded for 
transmission in many ways. Sending frame-to-frame differences and 
subsampling at half rate during active movement is simple, reasonably 
efficient, and does not fail catastrophically with an occasional trans- 
mission error. Sending line-to-line differences of frame-differences and 
subsampling during active motion is even more efficient and is rea- 
sonably simple, but it is somewhat less impervious to transmission 
errors. 

Adaptive linear predictive coding, which takes into account the 
speed and direction of movement in the scene, results in much greater 
coding efficiency, indicating that substantial bit-rate savings should 
be possible. However, system complexity and vulnerability to trans- 
mission errors are increased accordingly. This means that some kind 
of error detection-correction scheme is absolutely necessary if adaptive 
coding is to be used. 

Adaptive coding, which handles speeds of a nonintegral number of 
PEFs, performs better than if an integral number of PEFs is assumed. 
Relatively good results are obtained if for each block in the picture the 
weighting coefficients are chosen to minimize the mean squared moving- 
area differential signal. This technique requires transmission in some 
way of the weighting coefficients so that the decoder can correctly 
decode the received differential signal. Thus, if the block size is too 
small, then the system is inefficient because of the number of bits 
required to send the coefficients. However, even using only one set of 
weighting coefficients per field (block size = entire field), significant 
reductions in differential signal entropies are measured compared 
with nonadaptive coding. 

Much work remains to be done before these techniques can be used 
in a practical coder. For example, the subjective aspects of interframe 
coders which track moving objects are as important as the statistical 
aspects which are discussed in this paper. Also, many short cuts in 
implementation must be developed before these methods are economi- 
cal. Nevertheless, these results provide a valuable yardstick with which 
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simpler and more practical adaptive frame-to-frame coders can be 
compared. 
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